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Chapter 1 


Introduction 


In 1450, Johannes Gutenberg finished the invention of the printing press 
and thereafter books became widely available. This development increased 
the importance of ideas and information in their own right. The last 
decades of the fifteenth century marked a period of very intense explo- 
ration of many kinds. The academic and intellectual developments that 
occurred during that time period exemplify the Renaissance era. advanced 
calculus 


The Renaissance, as it occurred in Europe, was a virtual rebirth of modern 
society, a resurrection of European thought and culture, after more than 
one thousand years of oppression imposed during the Medieval period of 
history. Many artists and intellectuals, in Italy particularly, rediscovered 
past treasures, and began building upon them. As a result, science and 
mathematics began to reawaken from the thousand-year sleep as well. 
special functions 


As the sixteenth century began, Europe felt that it was on the verge of 
many great things. In the field of mathematics, many new shores began 
to appear within sight. For example, the solution of quadratic equations 
had been achieved long before, the quadratic formula having been known 
since Babylonian times (ca. 1800-1600 BC). But now mathematicians 
began to seek, and contemplated whether it was even possible to find, a 
general solution to a cubic equation, in other words, an equation of the 
form az? + br? + cz + d= 0. the gamma function 
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The quest ultimately led to the recognition of complex numbers, upon 
which a great deal of modern mathematics, and much of the mathemat- 
ical theory behind modern science and technology, is ultimately based. 
Complex analysis ranks amongst the most beautiful, elegant, subtle and 
powerful of mathematical disciplines. statistical mechanics 


In modern times, we typically simply point out that the quadratic equa- 
tion x? + 1 = 0 has no solutions in the field R of real numbers, because 
every sum of squares p? + 1 with p € R is positive, and we use that as our 
call for the existence of complex numbers. But due to the uncomfortable 
unfamiliarity with such numbers in the sixteenth century, an indirect, 
but more compulsory path was followed, involving cubics. A new era in 
mathematics was eventually inaugurated by the final recognition that the 
inherent incompleteness of R could be remedied through an extension to 
the field C of complex numbers. abstract algebra 


The development of the theory of complex numbers is a striking chapter 
in the history of mathematics. Complex numbers made their first appear- 
ance during the Renaissance, at which time they were often referred to 
as impossible quantities-even negative numbers were yet regarded with 
some skepticism, a consequence of the then prevalent reliance on classical 
geometric thinking (which does not admit negative lengths). A precise 
foundation for the theory of complex numbers would not be laid until the 
end of the eighteenth century. The quantity i = ,/—1, whose square i? = —1 
is negative, remained suspect and abstruse. number theory 


Two positives make a positive; a negative and a positive make a negative; 
two negatives make a positive. Why is this? A good common sense explana- 
tion is as follows: If one says "do eat", it is encouragement to eat (positive), 
but if one says "don’t eat", the opposite (negative) is being said. Now if one 
says "don’t starve" where "starve"="not eat", one is back to saying "eat". So 
two negatives make a positive. But the key observation here is that (++) 
and (——) both give (+). So how can a negative have a square root? college 
algebra 


One crucial observation is that positing the existence of \/—1 is not equiva- 
lent to actually taking the square root. It is impossible to actually calculate 
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the square root of —1, contrary to what the suggestive symbol \/—1 might 
seem to imply. But that does not mean \/—1 does not exist. Although it is 
impossible to calculate \/—1, it nonetheless exists, and we can calculate 
with the quantity. Let’s look at some examples of how \/—1 makes its pres- 
ence felt: two mathematical, one from physics and one from electronics. 
calculus 


Example 1.1. The quantity In(1) is pure imaginary. A real function with 
real argument can have a complex result: For the quantity In(1) is pure 
imaginary. But one must be willing to admit complex numbers to see this. 
If z € R then it is always the case that 


(x) e™@) =z and (xx) In(e?) =z. 


If z is complex, (x) is unremarkable, but (xx) merits comment. For suppose 
we take into consideration Euler’s identity e’” + 1 = 0, or, in other words, 
e" = —1, which is a special case of Euler’s formula: 


e7 = cosx + ising Vc eR 


a result that can be easily seen by comparison of the Taylor series expan- 
sions of e”, sin x and cos z. 


In particular, with x = 7, by taking half a turn around the unit circle in C: 


e" = cost + isinm 


which gives the identity e" + 1 = 0. Now squaring both sides of e" = —1 
gives ec" = 1, and then using (x+) might seem to imply 


In(e?*") = In(1) = ln(e), 
but that would be wrong, that 277 = 0. 


Over the complex field C, 1 4 e°, but rather 1 = e?+?™, What we actually 
should write then is not (e’")? = 1, but rather (e’")?” = 1 which, when we 
use (xx), yields 


In(e*") = In(1) = i200 n = 0, 41, +2,.... 
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So, In(1) has a zero real part, but in writing In(1) = 0, we in effect only 
recognize one of an infinitude of possible purely imaginary values; namely 
the one with zero imaginary part. In general, In(1) is a complex quantity 
that is legitimately considered pure imaginary, always having zero real 
part. trigonometry a 


Example 1.2. Using complex mathematics to perform real mathemat- 
ics — Cauchy’s residue theorem applied to real integrals. Cauchy’s 
residue theorem is a powerful tool to evaluate line integrals of analytic 
functions over closed curves and can often be used to compute real inte- 
grals as well. Don’t panic if you don’t understand the material that follows; 
it is all clearly explained later in the book. This material is simply meant 
to serve as a preview. introductory physics 


If a complex function is analytic (syn. holomorphic) on a region R, it is 
infinitely differentiable in R. A complex function may fail to be analytic at 
one or more points through the presence of singularities, or through the 
presence of branch cuts. A single-valued function that is analytic in all but 
a discrete subset of its domain, in other words at isolated points, and at 
those singularities goes to infinity like a polynomial (i.e., these exceptional 
points are poles), is called a meromorphic function. f has a pole of order n 
at z ifn is the smallest positive integer for which (z — zo)" f(z) is analytic at 
zo. computers on sale 


Definition 1.1. A function f has a pole at z if it can be represented by a 
Laurent series centered about zo with only finitely many terms of negative 
exponent, i.e., 


oO 


fe)= > apla — zo)" 


k=—n 
in some neighborhood of zo, with a-n # 0, for n € N. The number n is called 
the order of the pole. A simple pole is a pole of order 1. 


Definition 1.2. The constant a_, in the Laurent series 


Co 


fe) = So alz- z) 


k=—0o 
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of f(z) about a point z is called the residue of f(z). The residue of a 
function f at a point z is denoted Res(f; zo). 


The residues of a function f(z) may be found without explicitly expanding 
into a Laurent series. If f(z) has a pole of order n at z, then a, = 0 for 
k<—nanda_, #0. diamond rings 


Thus, 
f(z) = > alz — z)* = Yo iae — x)" 
k=—n k=0 
(z — 20)" f(z) = > Ax—n(2 — zo)". 
k=0 
Differentiating, 


£ (z — 20)" f(2)] = ip kak-n(z — a 


= > kak-n(z — zo)! 
k=1 


= X (K+ 1)ak-n1(2 — 20)". 
k=0 


[(z — zo)" F(2)] = X k(k + 1)Jar-n1 (2 — 20)" 


k=0 


= So k(k + 1)ar-n1(2 — z0)" 


k=1 


(k+ 1)(k + 2)ak-n42(2 — z0)". 


Me 


> 
ll 
(= 
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Iterating, 


qn} : 
Fon (z — 20)" F(z)] 


— i(k + 1)(k+2)-+-(k+n—1)ag_1(z — z0)" 
= (n = 1)la_y 


+ b+ (6 +2) +m Danil- z) 


k=1 


so we have 


n—1 
ia r le- ae) 
= lim (n — 1)!a_1+0 
z= zo 
= (n — 1)!aı 


and the residue is 


ee dm 


ay = (n— 1)! dz! [(z = 20)" F2) 5 


Proposition 1.1. In the case that lim._,.,(z — zo) f(z) exists and has a non- 
zero value r, the point z = zo is a pole of order 1 for the function f and 


Res( f; zo) =f; 


which follows directly from the preceding discussion. mathematical meth- 
ods for physicists 


For example, we can determine the residue at a simple pole as follows: 


Tz+í , Tz+í 
Res —.—. = a= 
z=zz(z2+1) 29% z(z+i)(z —1) 
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Cauchy’s residue theorem is stated as follows: Suppose U is a simply 
connected open subset of the complex plane, and aj,...,a, are finitely 
many points of U and also suppose f is a function which is defined and 
analytic on U\{ai,...,a,}, which is to say f is a meromorphic function that 
is not analytic specifically at the discrete set of points {a),...,a,}. Ifyisa 
rectifiable curve in U which does not meet any of the ap, and whose start 
point equals its endpoint, then 


gre) dz = 2ri D I(y, ap) Res(f, ap). 


Here, Res(f, ap) denotes the residue of f at ax, while I(y, ap) is the winding 
number of the curve y about the point a,. This winding number is an 
integer which intuitively measures how many times the curve y winds 
around the point ax; it is positive if y moves in a counterclockwise manner 
around ap, negative if y moves around a; in a clockwise manner, and 0 if y 
doesn’t move around a, at all. group theory 


In order to evaluate real integrals, the residue theorem is used in the 
following manner: the integrand is extended to the complex plane and its 
residues are computed (which is usually simple), and a part of the real 
axis is extended to a closed curve by attaching a half-circle in the upper 
or lower half-plane, forming a semicircle. The integral over this curve can 
then be computed using the residue theorem. Often, the half-circle part of 
the integral will tend towards zero as the radius of the half-circle grows, 
leaving only the real-axis part contributing to the integral, the one we were 
originally interested in. linear algebra 


There will be a need for bounding the absolute value of complex line 
integrals. The formula is 


[fe dz 


where L is the length of C and M is a constant such that | f(z)| < M every- 
where on C. 


< ML (ML-inequality) 


Consider the integral 
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which arises in probability theory. Its solution is resistant to elementary 
calculus techniques, but can be evaluated by expressing it as a limit of 
contour integrals. tensor theory 


Suppose t > 0 and define the contour C that goes along the real line from 
—R to R and then counterclockwise along a semicircle centered at 0 from R 
to —R. Take R to be greater than 1, so that the imaginary unit i is enclosed 
within the curve. The contour is shown in Figure 1.1. The contour integral 


is given by 
eit 


Figure 1.1: Path C of the contour integral. 


Since e’ is an entire function (having no singularities at any point in the 
complex plane), this function has singularities only where the denominator 
z*4+1is zero. Since z2? +1 = (z+7)(z — i), that happens only where z = i 
or z = —i. Only one of those points is in the region bounded by the given 
contour. complex analysis 


Because f(z) is 
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According to the residue theorem, then, we have 


—t 
e —t 


f f(z) dz = 2ri - Resf(z) = 2ni— = me 
c z=i 2i 
The contour C may be split into a straight part and a curved arc, 
R 
f f(x) d+ | fe) dz = te™ 
-R s 


and thus 


T, Ions fto PA 


Using some estimations, we have 
itz 

[e] 

s2’ +1 s 


1 1 TR 
< | —=—ldz|< | = ldz| = =~ —> 0as R 
< | gls | aiel op? 0a R> 


Therefore, for t > 0, we have 


lev) itz 
e —t 
f a de = Te : 
eter 


If t < 0 then a similar argument with an arc C” that winds around —i rather 


than 7 shows that 
ee) ettz F E : 
i CEN) Z= Te, 


co itz 
€ =|t| 
3 dz = re ". 
ae 4+1 


If t = 0 then the integral can be performed using elementary calculus 
methods and its value is 7. functional analysis 


itz le | 


s |2? + 1| 


|dz| = 


|dz| 


z2? +1 


so we conclude that 


Next we consider real, improper integrals of rational functions f(x) of the 


type : 
J f(a) de, 
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where it is assumed that the degree of the denominator of f(x) is at least 
two units higher than the degree of its numerator. 


Consider the contour integral corresponding to the above mentioned im- 
proper integral of a rational function, 


$ f(e) dz, 


around the semicircular path C, similar to that shown in Figure 1.1. Since 
f(x) is rational, f(z) has finitely many poles in the upper (or lower) half- 
plane, and if R is sufficiently large, then C encloses all these poles. By the 
residue theorem we obtain 


fie TE [te at fe f(E) de = 2ri ST Res f(2) 


where the sum consists of all the residues of f(z) at the points in the upper 
half-plane at which f(z) has a pole. From this we have 


a f(a) dx = 2ri X Res f(z) — [1o dz. 


Now if R — oo, the value of the integral over the arc S approaches zero. If 
we set z = Re”, then S is represented by R = constant, and as z ranges 
along S, the variable 0 ranges from O to 7. Since, by assumption, the degree 
of the denominator of f(z) is at least two units higher than the degree of 
the numerator, we have that 


k 
IF| < PE (z| = R > Ro) 
for sufficiently large constants k and Rọ. So by the ML-inequality, 


f f(z) dz 


As R > oo, the value of the integral over S approaches zero, and we arrive 
at 


k kr 


f f(x)dx = 2ri X` Res f(z), 
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where we sum over all the residues of f(z) corresponding to the poles of 
f(z) in the upper half-plane. vector analysis 


As an example, consider the following improper integral of a rational: 


[ dx 
0 1zi 


The complex function f(z) = 1/(1 + 2*) has four simple poles at the points 


a= aor Zo = eur 23 = em. z4 = eo t/4 


The first two of these poles lie in the upper half-plane. Through application 
of L’Hopital’s rule we find the corresponding residues to be 


z=z 1 
Ref) = [Era] = laal 


as =e 3ni/4 = — L et/4 


sin z = —(e” — 


we arrive at 


©% dx 2ri , , T 
Aney amia =ri/4\ __ DARE LE 
[ r= T ee nen = 


S> 


Since 1/(1 + zł) is an even function, we finally obtain 


[ dx =f. dr x 
o let 2j lee 2/2 
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Example 1.3. Quantum mechanical states are represented by vectors 
in a (possibly infinite-dimensional) complex vector space. Now we di- 
gress from pure mathematics to demonstrate the necessity of complex num- 
bers in describing the physical world. Here we consider the Stern-Gerlach 
experiment—an experiment involving a 2-state system which demonstrates 
the fact that quantum mechanical states are to be represented by vectors 
in an abstract complex vector space (note that functions, including wave- 
functions, can be thought of as infinite-dimensional vectors in the sense 
that each abscissa can be thought of as an "axis", while each corresponding 
ordinate value can be thought of as the value along that axis). We will 
finish by making an analogy with the polarization of light. Fourier analysis 


Quantum states are always vectors in an abstract vector space (techni- 
cally, a complex Hilbert space). For quantum wave functions, the Hilbert 
space usually has not only infinite dimensions, but uncountably infinitely 
many dimensions. Each wave function corresponds to a quantum state, 
which can be thought of as a vector in the state space, which is a Hilbert 
space. However, linear algebra is much simpler for finite-dimensional 
vector spaces. Therefore it is helpful to look at an example from quantum 
mechanics where the Hilbert space of wave functions is finite dimensional. 
In quantum mechanics, every finite-dimensional Hilbert space can be con- 
sidered a spin space. real analysis 


In formulating the Dirac equation for the electron, Paul Dirac combined 
relativity with the quantum wavefunction. He explained that quantum 
spin is ultimately a relativistic effect. Phenomenologically speaking, the 
effects to which the term spin refers will be explained below. 


As shown in Figure 1.2, in the Stern-Gerlach experiment, first, silver (Ag) 
atoms are heated in an oven that has a small hole through which some 
of the silver atoms manage to escape, which are then beamed through a 
collimator and subsequently subjected to an inhomogeneous magnetic field 
produced by a pair of magnetic poles, one of which has a very sharp edge. 
The magnetic field is chosen to be inhomogeneous such that the force on 
one end of the atomic dipole will be slightly greater than the opposing force 
on the other end, so that there is a net force which deflects the particle’s 
trajectory. dynamical systems 
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z-axis 


Beam direction 


To detector 
gem a tc 
oN 
Shaped Collimating 
Slit 
magnet 
(pole pieces) 


Figure 1.2: The Stern-Gerlach experiment. 


The silver atom is made up of a nucleus and 47 electrons, where 46 out of 
the 47 electrons can be visualized as forming an approximately spherically 
symmetrical electron cloud with no net angular momentum. Ignoring 
nuclear spin, the atom as a whole does have an angular momentum, due 
solely to the intrinsic (not orbital) spin angular momentum possessed by 
the 47th (5S) electron. The spins of the other 46 electrons cancel as far 
as atomic angular momentum is concerned due to the Pauli exclusion 
principle, which states no two electrons in an atom may possess the same 
quantum states, and Hund’s rule, which states that every orbital in a sub- 
shell is singly occupied with one electron before any one orbital is doubly 
occupied, and all electrons in singly occupied orbitals have the same spin. 
The 47 electrons are attracted to the nucleus, which is ~ 2 x 10° times 
heavier than an electron; as a result, the atom nets a magnetic moment 
equal to the spin magnetic moment of the 47th electron. That is to say, 
the magnetic moment p of the atom is proportional to the electron spin S, 
stated u x S (the proportionality factor is usually taken to be a negative 
number). differential equations 


The magnetic moment of a magnet is a vector quantity that determines 
the force the magnet can exert on moving electric charges and the torque 
that an external magnetic field would exert upon it. The magnetic mo- 
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ment has magnitude and direction. The direction of the magnetic moment 
points from the south to north pole of a magnet, an arbitrary convention. 
The magnetic field produced by a magnet is proportional to its magnetic 
moment. To be precise, the term ‘magnetic moment’ normally refers to a 
system’s magnetic dipole moment, which produces the first term in the 
multipole expansion of a general magnetic field. The dipole component of 
an object’s magnetic field is symmetric about the direction of its magnetic 
dipole moment, and decreases as the inverse cube of the distance from the 
object. differential geometry 


Now, u and B are defined such that the interaction energy of the magnetic 
moment with the magnetic field is just —w-B, so the z-component of the 
force experienced by our silver atom is given by 


0B. 


ð 
F, = —(u : B) S u 0, 
5, # ea! gz 008 


which mathematically states that the pull and push of the magnetic field in 
the z-direction upon the silver atom is proportional to its magnetic moment, 
to the ratio of the amount of rotation of the moment with or against the 
field to movement of the atom in the z-direction, to the degree of alignment, 
and to the degree of inhomogeneity of the magnetic field. Since the atom 
is much heavier than an electron, the classical concept of trajectory will 
suffice in this case (indeterminancy of position is not considered here). 


The silver atoms in the oven are randomly oriented; as is each p. If the 
electron were a classical spinning object, we would expect all values of ju, 
to be realized between |u| and —|,.|; in other words, we would expect to 
detect a continuous bundle of beams as shown in Figure 1.3a. But what 
we experimentally observe is that the Stern-Gerlach apparatus splits the 
initial silver beam from the oven into two distinct components as shown in 
Figure 1.3b — this phenomenon used to be known as "space quantization". 
Since u x S, only two possible values of the z-component of S are observed 
to be possible, S, up (spin-up) and S, down (spin down), labelled S,+ and 
S.—, respectively. Assuming the north pole of the magnet rests upon the 
table, situated below the south pole, the p, > 0 (S; < 0, or spin-down) atom 
experiences a downward force, while the u, < 0 (S, > 0, or spin-up) atom 
experiences an upward force. The beam splits according to the values of 
uz. The Stern-Gerlach apparatus measures the z-component of u and the 
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z-component of S up to a proportionality factor. The two possible values of 
S, are mulitples of some fundamental unit of angular momentum, which 
we get to momentarily. topology 


Screen Screen 


(a) (b) 


Figure 1.3: Beams from the SG apparatus; (a) is expected from classical 
physics, while (b) is actually observed. 


It turns out that every elementary particle has a specific, immutable 
value associated with it called the magnetic quantum number s, which we 
commonly call the spin. The magnetic quantum number can be viewed as 
a vector component of the total spin angular momentum S. In solving the 
Schrédinger equation, the total spin angular momentum for a spin-1/2 


particle is given by 
1/1 V3 
S = Gerr 


The value of the reduced Planck constant A is: 
h 
i= oe 1.0546 x 10-4 - s = 4.1357 x 10°’ eV-s 
T 


The Planck constant has dimensions of physical action, which are the 
same as those of angular momentum (i.e., L = r x p=rxmv); or in other 
words, energy multiplied by time, or momentum multiplied by distance. 
The magnetic quantum number, viewed as a vector component of the total 
spin angular momentum 5S along one axis, can only have the values +h/2. 
In terms of natural constants, these values for angular momentum are 
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functions only of the reduced Planck constant, with no dependence on 
mass or charge. 


The necessity of introducing half-integral spin goes back to the results 
of the Stern-Gerlach experiment. Since the beam split in two (there is 
no undeflected beam), the ground state therefore could not be integral, 
because even if the intrinsic angular momentum of the atoms were as small 
as possible, 1, the beam would be split into three parts, corresponding to 
atoms with L, = —1, 0, and +1. The conclusion was that silver atoms had 
net intrinsic angular momentum of 1/2. algebraic geometry 


In the case of the electron, the two possible values of S, are multiples 
of the fundamental unit of angular momentum, namely S, = h/2 and 
—h/2. This quantization of electron spin angular momentum is the first 
important property deduced from the Stern-Gerlach experiment. The 
other property we now embark on deducing demonstrates the importance 
of complex numbers in describing the natural world at a fundamental level. 


Consider that, in the Stern-Gerlach apparatus, we could just as well apply 
an inhomogeneous field in a horizontal direction, say in the x-direction, 
with the beam proceeding in the y-direction. Then the beam would sep- 
arate into S,+ and S,— components. So now let’s consider a sequential 
Stern-Gerlach experiment, as shown in Figure 1.4, in which the atomic 
beam passes through two or more Stern-Gerlach apparatuses in sequence. 
Let a symbol such as SGZ stand for an apparatus with an inhomogeneous 
magnetic field in the z-direction, and so on. graph theory 


Sequential SG case 1: We have the atomic beam pass through an SGZ 
apparatus, but then black the S,— component, yet have the S,+ component 
beam again pass through another SGZ apparatus, from which only an S,+ 
component emerges. No S,— component emerges. There is no surprise in 
this result. 


Sequential SG case 2: Have the atomic beam pass through an SGZ 
apparatus, and again block the S,— component, but then pass the S,+ 
component beam through an SGĉ apparatus from which an S++ compo- 
nent beam and an S,— component beam emerge. Again no surprise. 
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S.+ comp. 
No Sz- comp. 


Figure 1.4: Sequential Stern-Gerlach experiments. 


Sequential SG case 3: Once again have the atomic beam pass through 
an SGZ apparatus, and block the S,— component, pass the remaining S,+ 
beam through an SGĉ apparatus and block the S,,— beam. Now, when the 
remaining S,+ beam is passed through an SGZ apparatus, both an S,+ 
beam and an S,— beam emerge! That result is quite surprising and is the 
reason complex numbers are necessary to describe nature fundamentally. 


Quantum mechanics says one cannot determine both S, and S, simul- 
taneously. The selection of the S,+ beam by the second SGé apparatus 
destroyed any previous information about S,. 


Now by making an analogy with the polarization of light, we shall see 
clearly that quantum mechanical states are to be represented by vectors 
in a complex vector space. partial differential equations 


Let’s consider a monochromatic light wave propagating in the z-direction. 
Linearly polarized (plane polarized) light with a polarization vector in the 
x-direction, call it z-polarized light for short, has an electric field oscillating 
in the x-direction through spacetime, 


E = Eo& cos(kz — wt). 
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x filter y filter 
(a) 
| | 100% | | | | 
x filter x’ filter y filter 
(45° diagonal) 


(b) 


Figure 1.5: Light beams subjected to Polaroid filters. 


We may likewise consider y-polarized light, also propagating in the z- 
direction, 


E = Eo cos(kz — wt). 


Polarized light beams of the above mentioned type are produced by passing 
unpolarized light through a polaroid filter, as shown in Figure 1.5. A 
filter that selects only beams in the z-direction is called an x-filter. An 
x-filter becomes a y-filter simply by rotating it 90° about the propagation (z) 
direction. If a light beam is passed through an z-filter, then subsequently 
through a y-filter, no light emerges. classical mechanics 


The experimental observation becomes quite fascinating if we insert be- 
tween the z-filter and the y-filter yet another polaroid that selects only 
light that is polarized in the z’-direction - making an angle of 45° with the 
x-direction in the zy plane, as shown in Figure 1.6. Now a light beam 
emerges from the y-filter despite the fact that after the beam went through 
the xz-filter it did not have any polarization component in the y-direction. 
Once the 2’-filter selects the x’-polarized beam, all previous information 
on light polarization is destroyed. Notice the analogy to the situation we 
encountered earlier in the sequential SG experiments, if the following 
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Figure 1.6: Orientations of the x’ and y’-axes. 


correspondences are made 


S,+atoms + z-, y-polarized light 
S, + atoms + 2’-, y/-polarized light 


where the z’- and y’-axes are at 45° angles with respect to the x- and y-axes. 


Now examine how we may mathematically describe the 45°-polarized beams 
(the z’- and y’-polarized beams) using only classical electrodynamics. Using 
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Figure 1.6 and the Pythagorean theorem we get 


Eo’ cos(kz — wt) 


1 1 
=F, aa cos(kz — wt) + —=ycos(kz — et) f 


9 


Eog’ cos(kz — wt) 


v2 v2 


In the triple-filter arrangement the beam coming out of the first polaroid 
is an x-polarized beam, which can be regarded as a linear combination of 
an x’-polarized beam and a y’-polarized beam. The second filter selects the 
x'-polarized beam, which can in turn be regarded as a linear combination 
of an x-polarized beam and a y-polarized beam. And finally, the third filter 
selects the y-polarized component. 


1 1 
= Eo -3 cos(kz — wt) + 9g cos(kz — wt) : 


The correspondence between the third sequential SG experiment and the 
triple-filter experiment suggests that we may be able to represent the spin 
state of a silver atom by a vector in an abstract two-dimensional vector 
space, not to be confused with the usual two-dimensional (xy) space. Just 
as ĉ' and 9’ are the base vectors used to decompose the polarization vectors 
ĉ and gy, it is reasonable to represent the S,+ and S,,— states by vectors, 
called kets in the notation of Dirac. We denote these vectors by |S}; +) and 
|Sz; —) respectively. So we are free to conjecture in analogy that 


1 1 
ISzi +) = Fe 18:5 +) Ja lO ) 
1 1 
|Sa; =) = Va |S; ) V2 |S; =) 


That this is the case is the reason that two components emerge from the 
third SGZ apparatus. 


But the next question is how to represent the S,+ states. By symmetry, 
an S,+ beam going in the x-direction passing through an SGy apparatus 
ought to be similar to the case where an S,+ beam goes in the y-direction 
and passes through an SG& apparatus. The kets for S,+ should then also 
be regarded as a linear combination of |S_;+), but from the two equations 
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above, it appears we have already exhausted the available possibilities in 
writing |S,;+). How can our desired vector space formalism distinguish 
Sy+ from S,+ states? 


An analogy with circularly polarized light helps here. Circularly polarized 
light can be obtained from linearly polarized light by passing it through an 
optical device known as a quarter-wave plate. A right cicularly polarized 
light is a linear combination of an x-polarized light and a y-polarized light, 
where the oscillation of the electric field for the y-polarized component is 
90° out of phase with that of the x«-polarized component: 


1 1 
E = Fo an cos(kz — wt) + va cos(kz — wt + 5) 
When circularly polarized light is passed through an v-filter or a y-filter, 
either an x-polarized beam or a y-polarized beam is obtained, of equal 
intensity. But circularly polarized light is completely different from the 
45°-linearly polarized (z’- or y'-polarized) light discussed. 


Consider Euler’s identity e" = —1 which implies i = e’/? and thus for 
circularly polarized light we can write 
1 ; i . 
pe ~ i(kz—wt) ~ i(kz—wt) 
E = | —=te + —=ye 
v2" 


where Re(c) = E/E) and we used Re(e”’) = cos@. We make the following 
analogy with the atomic spin states: 


S,+ atoms + right-, left- circularly polarized light 


Applying this analogy if we are allowed to make the coefficients preceding 
base kets complex, then there is no difficulty in accomodating the S,+ 
atoms in the vector space formalism: 


1 i 
. — - ard 
lS; +) Z [Say +) ae J kee 
Through these analogies, we see that the two-dimensional vector space 
needed to describe the spin states of silver atoms must be a complex vector 
space; an arbitrary vector in the vector space used to represent quantum 
mechanical states is a linear combination of the base vectors |S,; +) with, 
in general, complex coefficients. quantum mechanics a 
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Example 1.4. Use of ,/—1 in the field of electronics. 

Now we consider Rayleigh’s puzzle — a problem that concerns the study 
of alternating current circuits which was resolved in the late nineteenth 
century with the help of complex exponentials. 


Before discussing the puzzle, we briefly survey a few facts about electricity. 
Two hundred years ago, the common view of electricity was that of a mys- 
terious fluid flowing along a pipe of wire. Oliver Lodge derisively referred to 
this view as the "drainpipe theory" of electricity. For direct current flowing 
through purely resistive circuits the drainpipe theory is perhaps enough, 
but for alternating current is simply does not suffice. 


For many types of materials, specifically metals and carbon, the rate at 
which electrical charge moves through the material (the current) is directly 
proportional to the applied voltage. This phenomenon is referred to as 
Ohm’s law. Because it applies only to a limited number of materials, Ohm’s 
law is not a law in the same sense that, say, conservation of energy is. 


What is charge? Electrical charge is the property of matter that is the basis 
for electrical effects. Charge is quantized and occurs in multiples of the 
charge on one electron which is —1.6 x 1071? coulombs (the charge on a 
proton is +1.6 x 1071? coulombs). 


Coulomb’s electrostatic force law is somewhat analogous to Newton’s 
inverse-square law for gravitational attraction. One simply substitutes 
qı and q for mı and m, and uses Coulomb’s constant ke in place of the 
gravitational constant G. Similar to Newton’s law, the force is along the 
line between the two charges, but although masses are always positive 
and the gravitational force is always attractive, electrical charges can be 
either positive or negative such that the electrostatic force can be either 
attractive (qıq2 < 0) or repulsive (q,q > 0). Electrically neutral macroscopic 
matter remains full of charge at the microscopic level — equal amounts of 
positive and negative charge cancel each other’s effects. relativity 


Electrical current is the movement of charge. A current of one ampere is 
said to be flowing through a cross-section of a conductor if one coulomb 
of charge passes through the cross-section per second. The presence of 
an electric field is what makes a charge move. An applied voltage is what 
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creates the electric field in a conductor, and the moving charges are the 
current. If current could be likened to fluid moving through a pipe, then 
voltage would be analogous to the pressure. 


One of the most basic electrical components are resistors, which are com- 
monly small cylinders of carbon with wire terminals at each end. Resistors 
are defined to be any device that obeys Ohm’s law, V = IR, where V is the 
voltage difference across the two terminals, I is the current through the 
resistor, and R is the resistance of the device to current flow. R is measured 
in ohms if V and I are measured in volts and amperes, respectively. 


Ww = Gee 


0- — >” 
V=IR eo par” 
dt dt 


Figure 1.7: Three common components used in electronic circuits: from 
left to right, the resistor, capacitor and inductor, respectively. 


Two other basic components commonly found in electric circuits are the 
capacitor and the inductor. These electrical components obey the math- 
ematical relationships shown in Figure 1.7, where C and L denote the 
capacitance and inductance, in farads and henrys, respectively. The figure 
also depicts the schematic symbols used to denote these electrical compo- 
nents in circuit diagrams. string theory 


The current J in Figure 1.7 flows in the direction of the voltage drop, from 
the + terminal to the — terminal. The plus/minus symbols do not actually 
mean plus and minus - the + terminal is simply at a higher voltage than 
the — terminal is. Both terminals could be positive when compared to 
ground. For example, consider a +8 volt + terminal and a +2 volt — termi- 
nal in which the voltage drop is 6 volts as we move from the + terminal 
to the — terminal. Electric fields point in the direction of the voltage drop, 
from the + to the — terminal. However, the actual physical charge carriers, 
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electrons, with their negative charge will move in the opposite direction, 
against the electric field, from the — terminal to the + terminal. 


There are just two more facts about electricity to learn before we consider 
Rayleigh’s puzzle. Kirchoffs laws are two laws used in circuit analysis that 
are actually restatements of the conservation laws of energy and charge 
(neither energy nor charge can be created or destroyed) applied to the 
special case of electronic circuits. 


Voltage law: Around any closed loop, the sum of all the voltage drops 
across the components is zero. 


Current law: The current flux at any given point is zero, or the sum of the 
currents flowing into a point equal the sum of the currents flowing out of a 
point. 


1.25 ohms 
volts 


1.75 ohms 


Figure 1.8: A simple resistor circuit. 


Now, consider the circuit shown in Figure 1.8. The battery’s voltage of 
1.5V also occurs across both halves of the right-hand side of the circuit. 
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Kirchoffs current law demands that J = J, + I. From Ohm’s law we have 


Vic = 0051, 
Ve = 1.051 
Vae = 1.2513, 
Vas iTi 


And from Kirchoffs voltage law, we have 


1.5 = Va + Vig = 1.544 
1.5 = Vac + Vea = 3H 


So J, = 1A and Jy = 0.5A. Take special note of the fact that J; and J) are 
both less than J = 1.5A and would be measured to be so. 


Now we are ready to consider Rayleigh’s puzzle. We extend the simple 
circuit of Figure 1.8 to the one shown in Figure 1.9 where all three kinds of 
electrical components are included and instead of a DC battery we use an 
AC voltage generator operating at radian frequency w. This means that if 
V(t) is a sinusoidal voltage with a maximum amplitude of Vmax volts, then 
V(t) = Vinax cos(wt). scientific computing 


The voltage V(t) completes one cycle of oscillation as wt varies from O to 
27. Thus, one cycle of oscillation requires a time of 27/w seconds. In one 
second there will be w/2z7 cycles, which is the frequency f, so w = 27f. The 
unit of frequency is the hertz. One hertz is equivalent to one cycle per 
second. Since angular frequency w is the number of radians per unit time, 
the unit of angular frequency is taken to be the inverse second s~}. 


The three circles labelled M, Mı and M in Figure 1.9 represent current 
meters with indicator needles that point to current values on a scale. 


Current meters such as those used in this example are called moving coil 
D’Arsonval meters — they utilize the magnetic field generated by the current 
along a wire. Their principle of operation is quite simple. A coil of stiff wire 
is suspended via low-friction supports in the magnetic field of a magnet. 
The current to be measured is passed through the coil, and its magnetic 
field interacts with the magnet’s field to produce a net force; this produces 
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Figure 1.9: Rayleigh’s current-splitting puzzle. 


a torque that rotates the coil, which moves a needle attached to the coil. 
The force or torque is proportional to the coil current. If this current varies 
slowly enough, the coil/needle apparatus can vary along with the current, 
but if the current varies rapidly the mechanical inertia of the coil/needle 
apparatus will smooth out the variation such that the meter responds to 
the average value of the current. advanced engineering mathematics 


Supposing the meter’s magnet is a permanent magnet, the torque is pro- 
portional to the product of the fields, which is directly proportional to 
the current in the coil. Such a meter would be useless for measuring AC 
currents since their average value is zero. But if we replace the permanent 
magnet with an electromagnet (a second coil wrapped around an iron 
bar) and run the current through it as well as through the suspended 
coil, now the two magnetic fields will vary together in response to the AC 
current. The torque is proportional to the product of the fields just as 
before, which in this case is directly proportional to the square of the AC 
current. Therefore, the needle deflection is proportional to the average of 
the squared current, which is nonzero for AC. 
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As before, we will now calculate the currents / and /, for Figure 1.9. But 
what we'll find is that while Kirchoffs current law remains satisfied, i.e., 
I= h + h, it is possible under certain conditions for J and I to both be 
measured to be larger than I. 


The following mathematical expressions apply to the electonic circuit in 
Figure 1.9, making use of Kirchoffs laws and the definitions of resistors, 
capacitors and inductors. 


l=ht+h, 
dl 
V = LR + L— 
2 F dt ’ 
Vea 
Ied 
1 dt ’ 
V=],R+4+ Via. 


Differentiating the last of these equations gives 


dV dh 1 
a "mT" 
This, together with the relation 
dlh 
V=hR+ T’ 


gives us two differential equations, one for each of the currents J; and H. 
If we assume V is sinusoidal, then the use of complex exponentials makes 
the solution of these differential equations straightforward, and we will 
thereby solve Rayleigh’s puzzle. cryptography 


We first introduce complex exponentials by writing 
V(t) = 2V cos(wt) = Ve + Voe™, 


where we include a factor of 2 to avoid having to include a factor of 1/2 in 
all the equations that follow, in other words, we shall halve the results later. 


Since e” represents a unit vector in the complex plane making an angle 
6 with the positive real axis, e** is a unit vector making an angle wt that 
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increases with t. In other words, e** is a rotating vector that rotates 


counterclockwise at frequency w/27 Hz. Similarly, e~“”’ is a vector rotating 
clockwise at the same frequency. Counterclockwise rotations are con- 
sidered positive rotations, whereas clockwise rotations are taken to be 
negative. 


We shall next calculate J; and K for just the first term of V(t), i.e., for the 
Voe term, and call the results J7 and I7. Then we repeat the analysis for 
the second term, i.e., for the Ve“ term, and call the results Iy and J;. 
The solutions for V(t) = 2Vo cos(wt) will then be 


L=It4+Iz, 
Ip = If +. 


Since V* = Voe“* we have If = Ij ,e"" and I} = Iġ,e™*, where Jy, and Jj, are 
constants. Substituting V*, Jj + Sad I; into the two differential equations 
gives 


iwVoe™ = iwRIf e" + alee, 
We” = RIje aus ID 
or, after canceling the common e** terms, 
iwVo = iwRIg, + ales, 
Vo = Rigg + iwLIġ>. 


Solving, we find 


+ Vo 
DEEE 
wC 
It Vo 
of R+iwL 
If these steps are then repeated for V% = Voe”’ with I; = Jp ,e"“’ and 


I; = Ip,e', the results are 
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a Vo 
Iji = R EA 
+i 
Vo 
la = ——. 
02 R-—iwL 


Therefore, the currents J; and J, are, for V(t) = 2V cos(wt), 


f DRE Vo ; Vo a 
L = Ije + Igie wt — z — et T — e iwt 
a tize 
and 
b £ Ige + ioe" = Vo eivt Vo —iwt 
i : R+iwl R-wL 


The expressions for J, and I appear complex, but they’re actually purely 
real. This must be so for two reasons, one physical and one mathematical. 
The physical reason is that if we apply the real voltage V(t) = 2Vo cos(wt) to 
a circuit made of physical hardware, then all the voltages and currents 
(observables) must be real too. Mathematically, we note that both J, and 
I, are the sums of two terms which are complex conjugates. Such a sum 
is equivalent to twice the real part of either term. Now since the above 
expressions for J; and J, are the currents for V(t) = 2v cos(wt), the currents 
for V(t) = Vo cos(wt) are half as much, so we have 


We can find the real part of these expressions through an application of 
the identity for the inverse of a complex number which is 
1 1 a—bi a o 


arbi ath a=bi a+b PENT 
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Doing this, we have 


| R 1/wC 
R? + (1/wC)? R? + (1/wC) 


= paneer |z aeie a sn(ut) 


Vo iwt 
Be l R+ wi. \ 
wL 


= Re fv È ory A r!l [eostet + isin(ur)] | 


Vo A 
= P+ LP [R cos(wt) + wL sin(wt)]. 


So we have 


zil | cos(wt) + isin(wt) 
|l |) 


and 


Vo L. 
V 
h = EAE [R cos(wt) + wLsin(wt)]. 
In this form, it is not obvious what the maximum value of the currents 
would be. However, with the help of a trigonometric identity we can get the 
expressions for the currents into a form in which the maximum values are 


apparent, that identity being 
acos(x) + bsin(z) = va? + b? cos fa — tan! (2) \ 
a 


Through an application of this identity, the expressions for the currents 
become 


l = 2 cos(wt + tan™!(1/wRC)), 
(eo 

I, = wu cos(wt — tan™t(wL/R)) 
R2 + WL} i 
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where the relation tan~'(—x) = —tan7'(x) is used. It follows that J, and I, 
are sinusoidal currents with squared maximum values J>, and Iĝ, given by 
V2 
Ta 7 EC. 
R? + (zo) 
V2 
I 2 Sa ae 
2 R24 (wl)? 


The reason for squaring the magnitudes of the currents is that that corre- 
sponds to the quantities that the D’Arsonval current meter responds, the 
needle deflection being proportional to the average of the squared current. 


Now we consider the special frequency w = 1/VLC at which Jj, and Jj, 
become equal, i.e., 


At that frequency we also have 


Vo aıf1 /L 
T=h+h= cos {ut + tan ( 5) 
/R+5 BENG 
Vo 


+ — cos? wt — tan”! BAE 
/R+5 RVC 


An application of the trigonometric identities 


cos(a) cos(b) = 5 (cos(a + b) + cos(a — b)) 


and i 
cos(tan™t(x)) = ET 
yields 
I= ne cos(wt) cos {ia (zv z) | 
VR+é 
= ee l cos(wt) 
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Therefore the squared maximum value of the current J, denoted /?, is 


2 Avi 


I , 
© (R+) (1+ ze) 


Comparing that with the expressions 


V2 
Ty 151 0 27 ° L 

Re 
we see that I> < Iĝ; = 1f, when L/(R’C) > 3 and w = 1/VLC. Under such 
conditions meter M will measure a smaller current than either of the 
meters Mı or M2, even though the splitting of the current J forms currents 
I, and I>. probability theory a 


It took many years for the engineering community to feel comfortable with 
counterintuitive results such as this, which explains why AC circuits were 
long thought to be somehow different from DC circuits. All circuits obey 
the same physical laws, however, and an important milestone in under- 
standing the role of complex quantities was passed in 1893, when Charles 
Steinmetz presented a paper at the International Electrical Congress in 
Chicago titled "Complex Quantities and Their Use in Electrical Engineer- 
ing." His paper was introduced to the audience with the following words: 


We are coming more and more to use these complex 
quantities instead of using sines and cosines, and we 
find great advantage in their use for calculating all 
problems of alternating currents, and throughout the 
whole range of physics. Anything that is done in this 
line is of great advantage to science. 


To the extent that numbers exist, /—1 exists, though it does not represent 
a calculable quantity. Of the other common types of numbers, say, out of 
the set {Z, Q, R, C} of familiar number systems, that have representations 
suggestive of calculation are the rationals, Q; numbers such as 2/3. But 
they, in contrast, can be calculated, due to our acceptance of the real 
number system, R. Just as with the symbol "7", the only valid question is 
how do we calculate with the symbol \/—1, rather than how we calculate 
the symbol itself, as we could with a rational. 
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The symbol i is not "the number we would get if it were possible to take 
the square root of -1", because that is impossible, yet we have the number. 
Rather, by definition, i = /—1, and it behaves so. game theory 


To avoid confusion, it should be mentioned that —1 actually has two square 
roots, namely +i, and that it is possible to calculate the square root of a 
complex number, just not the square root of —1. In particular, through an 
application of Euler’s formula, it is easily shown that i itself does have a 
square root, in fact two square roots, namely, either of the two complex 
numbers 


+vi = e/t = + (cos “ + isin) == ($ + 2) 
4 4 2 2 

Mathematicians tend to think of a complex number a + bi, where a and b 
are real numbers and i = y—1, as a point in the complex plane, a plane 
in which the value of a corresponds to placement of the point along the 
horizontal x-axis and the value of b corresponds to placement of the point 
along the vertical y-axis. The z-axis is known as the real axis, while the 
y-axis is known as the imaginary axis, and are usually labelled Re and Im 
respectively. Unlike the real numbers, complex numbers do not have a 
natural ordering, so there is no complex-valued analog of real inequali- 
ties. This property is no longer confusing when they are viewed as being 
elements in the complex plane, since points in any plane lack a natural 
ordering. The term natural ordering refers to ordering along a line. Most 
points within any plane are not mutually collinear. 


Complex numbers possess many properties the reals do not possess, prop- 
erties such as phase. If we were to draw a vector from the origin to the 
complex number’s point in C, then what angle would the vector make with 
the positive z-axis? Now if we were to trace out a circle with that same 
vector, then all the complex numbers residing on it would have the same 
distance from the origin, known as the complex number’s absolute value 
or modulus. So there are an infinite number of complex values having 
the same "magnitude" in a sense, but each has a different phase. One 
must be careful with the concept of magnitude in C; magnitude does not 
correspond to order within C as it does within the real number system 
R. There is more to complex numbers than just raw magnitude. It can 
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be helpful to realize that a complex number’s phase also carries its own 
"magnitude". Complex numbers can be considered to be ordered in two 
ways — concentrically and rotationally, so they are described as having 
no natural order, in other words, like that of the natural numbers. To be 
precise, it would be incorrect to say that the complex numbers "have no 
order", for that would imply they are distributed randomly in common 
mathematical parlance, which is nonsense. 


Despite the initial awkwardness, the successes achieved through the use 
of complex numbers have exceeded all expectations. The successes helped 
gain their full recognition, with no small thanks due to the representation 
of complex numbers as points on a plane, which enabled mathematicians 
to visualize them in a natural way. desktop computers 


Chapter 2 


Del Ferro, Tartaglia, Cardano & 
Bombelli 


We begin with the work of a character named Scipione del Ferro (1465- 
1526) who taught at the University of Bologna. Del Ferro is most famous 
for discovering a formula for solving a depressed cubic — a third-degree 
equation that lacks a quadratic term, in other words, one of the form 
az? +cxz+d= 0. For presentation, we usually divide through the entire 
equation by a and move the constant term to the right-hand side, to get 
the form 


e+mz=n. 


At the time, del Ferro’s discovery of how to solve a depressed cubic for x 
was quite significant, and by today’s customs, we might expect him to have 
shouted his findings off the rooftops. But he kept the solution secret. Why? 
The political climate of Renaissance universities was perilous. Academics 
often had to patronize the politically powerful — da Vinci’s experience being 
a very famous example. But there is more. The concept of tenure hadn’t 
even been thought of yet, nor did society have safety nets in those days. On 
top of this, mathematicians had to be ready for "public challenges", which 
were basically intellectual duels. Participation in challenges were strongly 
encouraged by the university community, and these often became wild 
events even attracting the attention of large gambling rings. A loss often 
marked the end of one’s career. So, quite understandably, del Ferro felt 
that he was in possession of a secret weapon, and he treated his discovery 
so. Even if del Ferro could not solve his challenger’s problems, he was 
highly confident that his cubics would not be solved by his adversary. 
35 
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On his deathbed, del Ferro passed his discovery along to his student 
Antonio Fior. A brash and arrogant young lad, Fior decided to go on an 
intellectual spree with the new weapon, and eventually by 1535 became 
testy with one of the greatest minds in Europe, the famous scholar Niccolo 
Fontana (1499-1557), whom Fior mistakenly presumed to be a pretender 
to the throne. 


During the French attack on Fontana’s hometown of Brescia in 1512, a 
soldier slashed Fontana’s face with a sword, disfiguring him and making it 
impossible for him to speak normally. He became known by a nickname, 
Tartaglia, which means stammerer, and by this name is he best known 
even today. laptop computers 


Tartaglia was a profoundly-gifted mathematician. He had been advertising 
the fact that he could solve cubic equations missing only their linear terms, 
which was what attracted the attention and the doubts of Fior. Fior sent 
him a challenge consisting of thirty depressed cubics, to which Tartaglia 
wisely responded, by sending Fior a list of thirty problems covering various 
difficult mathematical topics. Tartaglia was either going to earn a perfect 
performance or completely fail, depending on whether or not he found the 
secret of solving cubics. 


Tartaglia went on a research binge. And with the deadline approaching, on 
February 13, 1535, he discovered the depressed cubic’s solution. He then 
quickly solved all of Fior’s problems, whereas Fior suffered a miserable 
performance. It was a great triumph for Tartaglia. Although the agreed- 
upon stake was to provide thirty banquets for the victor, Tartaglia decided 
to relieve Fior of the burden, who then faded out of the picture. 


One of the most eccentric mathematicians in history enters the story at 
this point, Gerolamo Cardano (1501-1576), of Milan. Cardano had heard 
of the challenge and desired to learn Tartaglia’s method for solving the 
cubic. He simply asked Tartaglia to divulge the secret, and from there the 
story takes a few twists. cell phones on sale 


There is just "that something" about the intriguing lives of eccentric ge- 
niuses that cannot be ignored, so we pause and examine some of Cardano’s 
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Figure 2.1: Portraits of the great mathematicians (from left to right) Niccolo 
Fontana Tartaglia, Gerolamo Cardano and Rafael Bombelli. 


curious life first. Lucky for us, Cardano wrote an autobiography The Book 
of My Life in 1575, the year before he died. The book recounts many of 
Cardano’s superstitions, peeves and personal anecdotes. Though some of 
the material is regarded with a degree of skepticism, the book nonetheless 
conveys Cardano’s frame of mind and turbulent life. 


Cardano’s family tree is as eventful as his own life, and may have included 
Pope Celestino IV. In the chapter titled "My Nativity", Cardano wrote "al- 
though various abortive medicines — as [he] heard — were tried in vain," he 
survived to be "literally torn from [his] mother’s womb." The young Cardano 
was then bathed in warm wine in an effort to bring him to life. Cardano 
perceived his birth as unwelcome, and may have been illegitimate. The 
resulting stigma he felt played a powerful role in his life. 


Cardano was plagued with chronic mental conditions his entire life, yet 
his intellect was supreme, as is evident in his works. He had an intense 
fear of heights, and of places where rabid dogs had been seen. He endured 
years of impotence, which, as fate would have it, lasted until just before he 
was married. Cardano also suffered from bouts of insomnia, often staying 
awake for more than a week at a time. diamond jewelry 
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If Cardano found himself on occasion to not be suffering, then he would 
inflict suffering upon himself. Cardano equated pleasure with the relief of 
pain. He felt that self-inflicted pain was a very desirable thing, because it 
felt good when the pain stopped. 


Cardano studied at the University of Padua to become a physician, but was 
refused permission to practice medicine in his home of Milan. This refusal 
was probably due to his bizarre personality, but whatever the justification 
was, this event marked one of the lowest points of his life. 


Cardano then moved to the rural town of Sacco, where he practiced 
medicine. One night, he dreamt of a beautiful woman and, being one 
who put a great deal of stock in dreams, after having met a woman named 
Lucia Bandarini who resembled the woman in his dream, he married her 
in 1531. 


Dreams and omens guided Cardano throughout his life. He was an as- 
trologer, wore amulets and claimed to predict the future from thunder- 
storms. He also felt the presence of a guardian angel, and would participate 
in detailed conversations and at times even argue with the divine attendant, 
out loud in front of his colleagues. That being the case, it is no wonder why 
many of his contemporaries felt that Cardano was not in his right mind. 
watches 


Cardano was a prolific gambler, he had a weakness for it, gambling nearly 
every day. His work Book on Games of Chance, published posthumously 
in 1663, was the first serious treatise on probability. 


From 1526 to 1532, Cardano spent busy years in Sacco, practicing 
medicine and raising his children, but eventually moved with his wife 
and kids back to Milan for reasons related to his ego, yet ironically, was 
still forbidden to practice medicine there, which left them poor. But Car- 
dano began giving popular science lectures that were well received by the 
educated. He began writing on many topics and in particular, in 1536 
wrote a scathing work attacking the practices of Italian physicians. Though 
the medical community detested this work, the public embraced it, and 
Cardano could not safely be prevented from practicing medicine any longer. 
He went to the top of his profession, even treating the Pope on occasion. 
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But then a series of tragedies befell Cardano. In 1546, his wife died at age 
31, leaving Cardano with two sons and one daughter. Yet much to Car- 
dano’s delight, his eldest son, Giambattista, seemed bright and destined 
to follow in his father’s footsteps into a brilliant medical career. But Gi- 
ambattista married an unfaithful woman who boasted about the town that 
none of her three children were his. In retaliation, Giambattista presented 
her with a cake laced with arsenic. Apparently, he was a good baker - 
Giambattista was arrested for murder, he confessed and was convicted, 
then beheaded by the Roman Empire during April 1560. 


Cardano became despondent, and lost his interest in life, followed by his 
friends and career. He then moved to the University of Bologna to teach 
medicine. In 1570, he got himself arrested and imprisoned on charges of 
heresy, for antics such as publishing his results of casting the horoscope 
of Christ and writing a book titled In Praise of Nero, the Roman Emperor 
that offered Christians to the Colosseum animals as food, sentencing them 
ad bestias, "to the beasts" — the beasts in question included dogs, bears, 
boars and lions. However, on the part of friend’s testimony and some 
leniency granted by the Church, Cardano was released from prison, went 
to Rome, and eventually was given a pension from Pope Gregory XIII (after 
first having been rejected by Pope Pius V). thermodynamics 


Cardano died in Rome on September 20, 1576, the day he had (supposedly) 
astrologically predicted earlier; but some suspect that he may have com- 
mitted suicide so that his own "prophesied" date of death would manifest 
true. 


Now let’s return to the story of the cubic equation. Cardano was interested 
in the techniques Tartaglia used to defeat Fior. Cardano had written to 
Tartaglia many times, pleading with him for the solution. Eventually, Car- 
dano brought Tartaglia to Milan as a guest. On March 25, 1539, Tartaglia 
revealed the secret of the depressed cubic — written in cipher - to Cardano, 
who took an oath of secrecy based upon his Christian faith which, if vio- 
lated, would essentially result in eternal damnation to the flames of Hell, 
according to the beliefs of those times. 


Another character then appears in this epic. Ludovico Ferrari (1522-1565), 
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approached Cardano looking for work. Cardano had that day imagined 
that the squawking of a magpie was a good omen and thus hired the young 
man as an assistant. But, as fate would have it, Ferrari amazingly turned 
out to be at least as profoundly gifted as Tartaglia, and even before Ferrari 
was 20 years old, Cardano considered him a colleague. Cardano eventually 
shared Tartaglia’s secret with his protégé, and they made great strides 
together. 


For example, Cardano discovered how to solve the general cubic equation 


r? +br? +cr+d=0 


where the coefficients b, c and d may or may not be zero. But Cardano’s 
work depended upon reducing the general cubic to a depressed form, thus 
publication would violate his pledge of secrecy to Tartaglia. Ferrari had 
meanwhile succeeded in solving the quartic (fourth degree) polynomial. 
Ferrari’s technique depended upon reducing the quartic to a related cubic, 
and again Cardano’s pledge forbade its publication. bioinformatics 


But in 1543, Cardano and Ferrari traveled to Bologna where they inspected 
the papers of del Ferro. There, in del Ferro’s own handwriting, was the 
technique for solving the depressed cubic. The fact that Tartaglia’s and 
del Ferro’s solutions were identical did not bother Cardano, del Ferro’s 
work having preceeded Tartaglia’s work. So in the year 1545, Cardano 
published Ars Magna, a title which reflects Cardano’s belief that algebra 
was "the great art". Ars Magna began with some introductory material, 
including standard solutions to linear and quadratic equations. But then 
it jumped into uncharted territory and laid out for the first time a complete 
procedure for solving cubic and biquadratic (third- and fourth-degree) 
algebraic equations. It was not until the arrival of mathematicians at the 
level of Francois Viéte (1540-1603) and René Descartes (1596-1650) that 
the book’s contributions were superseded. 


In Chapter XI, titles "On the Cube and First Power Equal to the Number," 
the preface reads: 
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Scipio Ferro of Bologna well-nigh thirty years ago dis- 
covered this rule and handed it on to Antonio Maria 
Fior of Venice, whose contest with Niccolo Tartaglia of 
Brescia gave Niccolo occasion to discover it. He gave it 
to me in response to my entreaties, though withhold- 
ing the demonstration. Armed with this assistance, I 
sought out its demonstration in many forms. This was 
very difficult. 


Cardano had merely given credit where credit was due, which appeased 
everyone except Tartaglia. He considered Cardano’s act treachery. In 
Tartaglia’s eyes, Cardano violated an oath pledged on his Christian faith, 
and was, therefore, a heathen. Cardano refused to respond to Tartaglia’s 
vitriolic letters, but Ferrari gave in to his legendary hot temper, and lashed 
back with demeaning threats, such as "If it were up to me to reward you, 
..., I would load you up so much with roots and radishes, that you would 
never eat anything else in your life", all pun intended. Eventually, a chal- 
lenge was exchanged between Tartaglia and Ferrari, in Milan in August 
of 1548. Ferrari won the battle and Tartaglia withdrew to return home, 
probably lucky to leave with his life. nuclear physics 


Such were the absurd set of events surrounding the solution of the cubic. 
Now we consider the great theorem at the heart of all this drama. 


In Chapter XI of Ars Magna, Cardano did not give a general proof, but 
rather a specific example of a depressed cubic, namely 


xv? + 6x = 20. 


But in our discussion, we shall paraphrase his work and treat the more 
general depressed cubic 
r? +mr =n. 


Before getting into the specifics of Cardano’s technique, a few points are 
worth mentioning. The material on cubic equations first appears in Chap- 
ter XI of the Ars Magna, which, as we stated previously, is titled "On the 
Cube and First Power Equal to the Number." From the perspective inherited 
from medieval mathematicians, there were considered to be 3 different 
depressed cubics and 13 different general cubics, rather than just one of 
each. This is because they only considered positive real numbers; they did 
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not admit complex numbers, nor did they consider nonpositive coefficients. 
The rule Tartaglia gave Cardano covered the three basic forms of the de- 
pressed cubic. In modern form, these would be: x? + br = c, x? = br +c 
and r? +c = br. Since mathematicians at the time did not use negative 
coefficients, a single, general form z? + br + c = 0 was precluded. 


In addition, our modern algebraic notation still lay in the future, and 
most of the mathematical statements were verbal. Cardano could capture 
generality in the rules for solving the pseudo-general cubic equations. In 
each chapter, then, Cardano first gives a geometrical demonstration of 
a specific numerical cubic equation, then a verbal rule for solving that 
general type of equation, then one or more sample problems and solutions 
using the rule. Because the use of zero and negative coefficients still lay in 
the future, Cardano is forced into spelling out 13 different cubic equations, 
all with positive coefficients, each with a separate chapter. 


Now, we can write a general cubic equation 
py +qy +ry+s=0 
(in which p # 0, or the equation is not a cubic) in the form 
3 2 = 
a? by +cy+d=0 


after dividing by the constant p. The case where d = 0 is trivial and not 
considered a true cubic. 


So, initially we get different cases as 


bŽ0, c=0, d2z0 
< < 


leading to 3-3-2 = 18 cases intially. Note that here we should interpret > 
and < in medieval terms, that is, indicators that the coefficient occurs on 
the left or right side of the equation, respectively. 


However, the cases 


=c¢c=0, d>0 
=c¢c=0, d<0 
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were not taken seriously as cubic equations (the first has no positve real 
solution and the second is trivial), while the cases 


b>0, c>0, d>0 
b=0, c>0, d>0 
b>0, c=0, d>0 


also have no real positive solutions. This leaves 18 — 5 = 13 cases to be 
considered. 


The key result of Chapter XI of Ars Magna is stated here in Cardano’s own 
manner, and his extremely clever partitioning of the cube is presented. 
The rule for solving cubics, as presented, is wordy and confusing, but 
the reader will come to a better understanding when it is recast in more 
familiar algebraic terms. 


As William Dunham puts it in his fascinating book Journey Through Ge- 
nius, "His argument was purely geometrical, involving literal cubes and 
their volumes. Actually, the surprise here is minimized when we recall 
the primitive state of algebraic symbolism and exalted position of Greek 
geometry among Renaissance mathematicians." 


Theorem 2.1. Rule to solve z? + mz =n. 

Cube one-third the coefficient of x; add to it the square of one-half the 
constant of the equation; and take the square root of the whole. You will 
duplicate this, and to one of the two you add one-half the number you have 
already squared and from the other you subtract one-half the same. Then, 
subtracting the cube root of the first from the cube root of the second, the 
remainder which is left is z. 


Proof. Cardano pictured a large cube, of side AC, whose length we denote 
by t, as shown in Figure 2.2. Side AC is divided at B into segment BC of 
length u and segment AB of length t — u. The large cube can be mentally 
subdivided into six pieces (use Figure 2.2 as a guide), and each piece is 
identified and assigned a volume as indicated: 
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Figure 2.2: Cardano’s partitioned cube. 


e A small cube in the lower front corner of volume u?. 
e A larger cube in the upper back corner of volume (t — u)’. 


e Two upright slabs, one facing front along AB and the other facing 
right along DE, each of volume tu(t — u). 


e A tall block in the upper front corner, standing upon the small cube, 
of volume u?(t — u). 


e A flat block in the lower back corner, beneath the larger cube, of 
volume u(t — u)?. 


Certainly the large cube’s volume, të, equals the sum of these six compo- 
nent volumes. That is, 


t = u’? + (t — u)? + Qtut — u) + wt — u) + ult — u)’. 
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A slight rearrangement of terms gives 
(¢ — u)? + [2tu(t — u) + u’ (t — u) + u(t — u)?] =P — u’ 
and factoring the common (t— u) term from the bracketed expression yields 
(t — u)? + (t — u)[2tu +u? + u(t —u)] = t — u’. 


Or simply, 
(t — u)? + 3tu(é — u) = BP — u’. (2.1) 


Now, Equation 2.1 resembles the cubic x? + mz = n. If we let t— u = x, then 
it becomes z? + 3tux = t? — u’, and this immediately suggests that we set 


Bu =n. 


3tu=m and 
If we can determine the values of t and u in terms of m and n from the 
original cubic, then x = t — u will yield the solution. To start, consider the 
two previous equations involving t and u, namely 


3tu=m and -u =n. 
From the first equation, we see that u = m/3t, and substituting this into 
the second equation gives 
3 
3 m 


-2768 
And now the key step - multiply both sides by t? and rearrange to get 


3 
m 
tê — n- — =0, 
Ha 
At first glance, this might seem to be a step backwards, trading the third- 
degree equation in x for a sixth-degree equation in t. But the later can be 
regarded as a quadratic equation in t’, 


3 
BAF o By on PS 
(E) — n(t?) 37 0. 
An application of the quadratic formula, which had been in use since 
Babylonian times, then yielded 
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eR mar 
2 
nil [2 | 4m? on, Jn? m? 
2 2 27 2 4 27 


Then, retaining only the positive square root, we arrive at 


TEE ae n? m? 
“VZV a" Z7 


And since u? = t? — n, we naturally conclude that 


So the algebraic statement of Cardano’s rule for solving the depressed 
cubic z? + mz =n is 


xrx=t—u 
oam n? m? 3f on n? m? 
SNIA Na Ira OF 


This formula is called an "algebraic solution" or a "solution by radicals" 
for the depressed cubic. Solutions of this type involve only the original 
coefficients of the depressed cubic - m and n — and algebraic operations, 
used only a finite number of times. It gives exactly the same result as 
Cardano’s verbally-stated rule. 


O 


Cardano’s approach in the proof of the previous theorem may leave one 
feeling bewildered. Though his generation was far more practiced at geo- 
metrical thinking than ours, how and why would he even imagine to relate 
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the initial 6-partitioning of a literal cube to this particular algebraic prob- 
lem? A statement made by the famous Polish-American mathematician 
Mark Kac (1914-1984) relates to this question — his famous distinction 
between the ordinary genius and the magician genius: "An ordinary genius 
is a fellow that you and I would be just as good as, if we were only many 
times better. There is no mystery as to how his mind works. Once we 
understand what he has done, we feel certain that we, too, could have 
done it. It is different with the magicians ... the working of their minds is 
for all intents and purposes incomprehensible. Even after we understand 
what they have done, the process by which they have done it is completely 
dark." Cardano’s proof is magical. Taking much for granted, his proof 
reads straightforward, but the initial idea, the partitioning of a literal cube, 
is not at all obviously related to the objective. 


But, from a modern perspective, the key insight in his derivation was 
to transform the cubic problem into a quadratic problem. This simple, 
yet brilliant step is suggestive of a method for solving equations of even 
higher degrees. However, note that in algebra, the Abel-Ruffini theorem 
(also known as Abel’s impossibility theorem) states that there is no general 
algebraic solution — that is, solution in radicals — to polynomial equations 
of degree five or higher. 


The content of the Abel-Ruffini theorem is frequently misunderstood. The 
theorem only concerns the form that such a solution must take. It does 
not assert that higher-degree polynomial equations are unsolvable. In fact, 
the opposite is true: every non-constant polynomial equation in one un- 
known, with real or complex coefficients, has at least one complex number 
as solution; this is the fundamental theorem of algebra, which can also 
be stated equivalently as: every non-zero single-variable polynomial with 
complex coefficients has exactly as many complex roots as its degree, if 
each root is counted up to its multiplicity. The reader should note that in 
these definitions, the complex numbers C are considered to include the 
real numbers R as a subset, so by the phrase "complex number" we do not 
necessarily mean "purely imaginary number", which is a complex number 
with real part precisely equal to zero, nor do we necessarily exclude the 
pure reals. 
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Although the solutions cannot always be expressed exactly with radicals, 
they can be computed to any desired degree of accuracy using numerical 
methods such as the Newton-Raphson method or Laguerre method, and 
in this way they are no different from solutions to second, third or fourth 
degree polynomials. 


Simply plugging appropriate values into the algebraic equation allows us 
to easily find a solution for Cardano’s example x? + 6x = 20, 


= ¥/104 Vi08 — {/—10 + Vi08 


which is obviously a "solution by radicals" as the saying goes. 


Surprisingly, this complicated expression is equivalent to the number 2, as 
one may check through calculation. Also, as the reader may easily check, 
x = 2 is indeed a solution of z? + 62 = 20. 


Now, having found one solution to the cubic, we are actually in a position 
to find any others. Since x = 2 solves the above equation, we know that 
x — 2 is a factor of x? + 6x — 20. Long division will generate the other. In 
this case, we have x? + 6x — 20 = (x — 2)(x” + 2x + 10). The solutions to the 
original cubic can be obtained by solving 


x-2=0 and 274+27r+10=0. 


The quadratic equation above has no real solutions, so the cubic has x = 2 
as its only real solution. 


What about the general cubic of the form az? + bz? + cr + d = 0? Cardano 
could solve the depressed cubic, but ultimately, despite Cardano’s brilliant 
restatement of the method, it was del Ferro’s discovery how to do so. 
Cardano himself made the great discovery that, by means of an appropriate 
substitution, the general equation could be replaced by a related depressed 
cubic that was susceptible to solution. Overall, the process looks like this: 


1. Depress the cubic: Achieved by substituting into the original cubic an 
appropriate substitution formula in a new variable, say y. (Cardano) 


2. Solve the depressed cubic: Find y in terms of the depressed cubic’s 
variables, say m and n. (del Ferro) 
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3. Recover the value of the original general cubic’s variable, say x, in 
terms of y, using the substitution formula. (Cardano) 


Before we demonstrate the depressing process for the cubic, it is instructive 


to see the effect of applying it to quadratic equations. Suppose we have a 
general second-degree equation 


ax? +ba+c=0 where a¥0. 


To depress the quadratic — that is, to eliminate ita first-power term — we 
substitute x = y — b/2a, which leads to 


b? b? 
2 
by + b =0. 
2 ae e ae 
After cancellation, then by rearranging, we arrive at the depressed quadratic 
2 Ber b E b? — 4ac 
R d S ge 
We have 
2 b — 4ac _ tv? — 4ac 
y? = ——.— and y ; 
4a? 2a 


And as a final step we calculate, 


b  +tVb?—4dac b _ -bvb — 4ac 
2a 2a 2a 2a 
which is the familiar quadratic formula. 


r=y 


So, we see that depressing polynomials can be rather useful. Now let’s 
attack the general cubic. In this case, the appropriate substitution turns 
out to be x = y — b/3a, giving us 


b \’ b \? b 
u(y =) -a(x =) re(y Z) +4=0. 
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Simply expand this to get 
b? be 2b? b’ 
3 2x Aj — | Dira ae 
(o oy 3a” aa | (w 3a” ig z) 


b 
+(a- $2) ta=0 


All we have to do here is make a single key observation: the y? terms will 
cancel out. The new depressed cubic will lack a second-degree term as 
desired. Now, if we divide through by a, the resulting equation will of 
course take the form y? + py = q. We solve this as before, and from there 
easily determine z = y — b/3a. 


As an example, consider the cubic 
2x° — 30x? + 162x — 350 = 0. 
Using the substitution x = y — b/3a = y — (—30/6) = y +5, we have 
2(y +5)? — 30(y + 5)? + 162(y + 5) — 350 = 0, 
which when simplified becomes 
2y? + 12y — 40 =0 orsimply y* + 6y = 20. 


But this is the depressed cubic solved earlier, so we know y = 2. So we find 
x =y+5=7, and this checks in the original cubic equation. 


This seems to have brought matters to a conclusion. The cubic had been 
solved. But wait! Cardano’s formula introduced something mysterious. 


Consider the depressed cubic x? — 15x = 4, for example. Inserting the values 
m = —15 and n = 4 into Cardano’s formula, we get 


= /2+V=121 VEEN. 


Mathematicians were hardly comfortable with negative numbers in the 
sixteenth century. This had much to do with the strong emphasis on geo- 
metric thinking — there is no such thing as a negative amount of distance 
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in classical geometry. So the square roots of negative numbers seemed 
absolutely preposterous to them. Thus, it was natural for them to dismiss 
an example such as this as an "unsolvable cubic". However, the cubic 
above has three different and purely real solutions: x = 4 and x = —2 + V3. 


What were mathematicians to make of these cases? Though he did spend 
some time on the subject, Cardano ultimately dismissed the business of 
complex numbers as being "as subtle as it is useless." 


It would be many years before Rafael Bombelli (1526-1573), in his 1572 
treatise Algebra, asserted the necessity of complex numbers as a vehicle 
to transport the mathematician from the real cubic to its real solutions. In 
certain cases, though we begin and end in the familiar terrain of real num- 
bers, complex numbers are absolutely necessary to complete the journey. 


Let’s examine briefly Bombelli’s work. We cube the complex number 2+ ./—1 
to get 


(24+ /—1)? =8412V/-1-6-V-1 
=24+11/-1=24+ V-121. 


But if (2+ /—1)? = 2 + /—121, then it certainly follows that 


4) 2 af 191 = 24 4/—1. (2.2) 


In a similar way, we can also see that 


\/ -2 + V—121 = —2 + V-1. 


Concerning the cubic x? — 15x = 4, Bombelli arrived at 


a= 4/24 V1 E ee or 
=(2+V7-1)-(-2+v-)=4 


which is one of the correct answers. 


Bombelli’s observation raises more questions than it answers. How does 
one know the truth of Equation 2.2 beforehand? Bombelli was essentially 
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walking "backwards" into the problem. The world would have to wait until 
the middle of the eighteenth century for Leonhard Euler to discover a 
sure-fire technique for finding roots of complex numbers (in a "forward" 
direction). And what exactly are these complex numbers, and how were 
they different than real numbers? The full scope of the importance of 
complex numbers was not realized until the work of Euler, Gauss and 
Cauchy, over two centuries after Bombelli’s work. 


The work of del Ferro and Cardano created a need for the recognition of 
complex numbers, whereas Bombelli was the first to actually give that 
recognition. Bombelli was the first to teach the art of correct formal com- 
putation with complex numbers. 


Now, a few things need explicit mentioning here. Contrary to popular belief, 
or what may seem to be implied in some textbooks, complex numbers did 
not enter the realm of mathematics out of a need to solve quadratics. Com- 
plex numbers entered as a tool for solving cubics. Mathematicians could 
more easily dismiss /—121 when it appeared as a solution to z? + 121 = 0, 
which has no real solutions. But they could not so easily dismiss /—121 
when it played such a key role in arriving at the solution x = 4 for the 
previous cubic. It was the relation of complex numbers to cubics, not 
quadratics, that made complex numbers interesting, which eventually led 
to their legitimization. 


Nevertheless, the fact that /—1 is a solution to z? + 1 = 0 does give a per- 
fectly valid illustration of the utility of complex numbers, and the mythology 
concerning the role of the quadratics might be preferable in some situa- 
tions, in terms of expedience. In the rest of this work, we usually adhere 
to this mythology, for that very reason. 


Also note the following nuance in nomenclature. Cardano’s name is usu- 
ally spelled Cardan in the English language. We shall forgive the minor 
alteration and use the name Cardan in the rest of this book, for no other 
reason but consistency with the greater part of the mathematical literature. 


Chapter 3 


The Complex Field 


In the 16th century, Cardan reluctantly worked with complex numbers in 
solving quadratic and cubic equations. Later in the 18th century, functions 
involving /—1 were found by Euler that were solutions to differential 
equations. In a letter dated October 18, 1740 to John Bernoulli, Euler 
wrote that the solution to the differential equation 
2 
Hays, y(0) = 2, y/(0) =0 


can be written in two ways: 


y(x) = 2 cose 
ylz) = etv-1 Ea e721, 


It became more and more apparent over time, as manipulations involving 
complex numbers became more common, that many problems in real 
mathematics could be more easily solved using complex mathematics. 
Complex numbers gained wider acceptance after Gauss developed the geo- 
metric representation of complex numbers, presented in April of 1831 ina 
memoir to the Royal Society of Göttingen. Gauss realized that capturing 
the intuitive meaning of complex numbers was sufficient to admit the 
numbers into the practice of mathematics. But the first formal definition 
was given by William Hamilton in a paper presented in June of 1835 to the 
Irish Academy titled "Theory of Conjugate Functions or Algebraic Couples: 
with a Preliminary Essay on Algebra as a Science of Pure Time." The paper 
was partly metaphysical, which we skip over, but with the mathematics 
contained therein we begin and consider the field of complex numbers. 
53 
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3.1 The Complex Field 


A group is an algebraic structure consisting of a set together with an 
operation that combines any two of its elements to form a third element. 
To qualify as a group, the set and the operation must satisfy a few con- 
ditions called group axioms, namely closure, associativity, identity and 
invertibility. An abelian group, also called a commutative group, is a group 
in which the result of applying the group operation to two group elements 
does not depend on their order. 


A ring is an algebraic structure consisting of a set together with two binary 
operations usually called addition and multiplication, where the set is an 
abelian group under addition. A commutative ring is a ring in which the 
multiplication operation is commutative. 


A field is a commutative ring whose nonzero elements form a group under 
multiplication. The complex numbers form a field, the complex field, which 
is denoted by the symbol C. 


As we shall soon see, complex numbers can be written in the form a + bi, 
where a,b € R andi = /—1. But such a statement is not a formal definition, 
for it assumes that \/—1 makes sense in any system. The existence of a 
system in which a quantity such as \/—1 does make sense is what needs 
to be established. Furthermore, the operations of "addition" and "multipli- 
cation" that appear in the expression a + bi have not been defined in terms 
of such a system. Hamilton’s formal definition that follows defines these 
operations in terms of ordered pairs. 


The complex field C is the set of ordered pairs or real numbers (a,b) with 
addition and multiplication defined by 


(a,b) + (c,d) = (a+c,b4+d) 


(a, b)(c, d) = (ac — bd, ad + bc). 


The associative and commutative laws for addition and multiplication as 
well as the distributive law are a direct consequence of the very same 
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properties for the real numbers. The additive identity, what we shall call 
"zero", is (0,0) and the additive inverse of (a,b) is (—a, —b). The multiplica- 
tive identity is (1,0). 


Now to find the multiplicative inverse of any nonzero (a, b): 


Set 
(a, b)(z, y) = (1,9), 
Then 
ax — by =1 
bx + ay = 0. 


As the reader can easily verify, the system of equations has solutions 


a —b 
rt = ——— zen 
qe” y a2 4 b2 


The complex numbers thus form a field. 


The subset of C having the form (a,0) is isomorphic to R, meaning all 
operations are preserved. (0,1) is y—1 since 


(0, 1)(0,1) = (—1,0) = —1 and we denote i = (0,1) = V—1. 


Hamilton’s formality is important for understanding why complex numbers 
are written the way they are; they are written in the form a + bi because 
any complex number (a,b) can be written as 


(a,b) = (a,0) + (0,b) = (a,0) + (b,0)(0,1) = a + bi. 


Consider a complex number z = (x,y). x is called the real part and y the 
imaginary part of z, written 


x= Rez, y=Imz. 


By definition, two complex numbers are equal if and only if their real 
parts are equal and their imaginary parts are equal. i = (0,1) is called the 
imaginary unit. If x = 0, then z = iy and is called pure imaginary. 
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The notation z = x + iy for complex numbers is exclusively used in practice. 
For addition it gives 
(x1 + iy1) + (£2 + iy2) = (x1 + x2) + iyı + y2). 


For multiplication it gives the following very simple recipe. Multiply each 
term by each term and use i? = —1 when it occurs: 


(x1 + iy1)(£2 + iy2) = £1£2 + i£1Y2 + tye + YY 
= (x£1£2 — Y1Y2) + il£1Y2 + Loy). 


Example 3.1. Let z; = 8 + 3i and z3 = 9 — 2i. Then Rez, = 8, Imz, = 3, 
Re z2 = 9, Im z2 = —2 and 
zı + 29 = (8 + 3i) + (9 — 2i) = 17 +i 
212 = (8 + 3i) (9 — 2i) = (72 + 6) + i(—16 + 27) = 784 1li 


Subtraction and division are defined as the inverse operations of addition 
and multiplication, respectively. Thus the difference z = zı — z is the 
complex number z for which 21 = z + z2. Hence, 


Zi — Z2 = (xy = T2) + ily ga Yə). (3.2) 


The quotient z = 21/22 (z2 # 0) is the complex number z for which z, = 
zzə2. The practical way to get the rule is by multiplying numerator and 
denominator of z1/z2 by z2 — iy. and simplifying: 


ry +4 vt +4 v2—-1 L1xX_Q + x =T 
paie Ee. 1+ iyı) (x2 — iy2) —_ 7172 T Y1942 jai 291 192 (3.3) 


£2 +iy2 (2 + iy2)(x2 — iy2) r3 +y r3 + y3 


Example 3.2. For z1 = 8 + 3i and z = 9 — 2i we get 


zı — % = (8 + 3i) — (9 — 2i) = —1 + 5i 
z 8+3i (8+3i)(9+2i) 66+43i 66 43 


z 9-2 (9—-2)\(94+2)) 8144 85 | 85 
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Properties of addition and multiplication are the same as for real numbers, 
from which they follow (here, —z = —x — iy): 


Commutative laws: 

21 +F 22 = 22 + 21 

Z122 = 2221 
Associative laws: 
(241 + 22) + 23 = 21 + (z2 + z3) 

(z122)23 = 21 (2223) 

Distributive law: 
zı(z2 + 23) = 2122 + 2123 


We also have: 


3.2 The Complex Plane 


This was algebra. Now comes geometry: the geometrical representation of 
complex numbers as points in the plane. This is of great practical impor- 
tance. The idea is quite simple and natural. We choose two perpendicular 
coordinate axes, the horizontal x-axis, called the real axis, and the vertical 
y-axis, called the imaginary axis. On both axes we choose the same unit of 
length. 


This is called a Cartesian coordinate system. We now plot a given complex 
number z = (x,y) = x +iy as the point P with coordinates x,y. The ry-plane 
in which the complex numbers are represented in this way is called the 
complex plane. The complex plane is sometimes called the Argand dia- 
gram, after the French mathematician Jean Robert Argand (1768-1822), 
born in Geneva and later librarian in Paris. His paper on the complex 
plane appeared in 1806, nine years after a similar memoir by the Norwe- 
gian mathematician Casper Wessel (1745-1818), a surveyor of the Danish 
Academy of Science. 
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the complex plane 


Instead of saying "the point represented by z in the complex plane" we say 
briefly and simply "the point z in the complex plane." This will cause no 
misunderstandings. Addition and subtraction can now be visualized as 
illustrated in the following figures. 


Figure 3.3: Addition of Figure 3.4: Subtraction 
complex numbers of complex numbers 


3.3 Complex Conjugate Numbers 


The complex conjugate Zz of a complex number 


z = x + iy is defined by 7 = x — iy 
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and is obtained geometrically by reflecting the point z across the real axis, 
as shown in the next figure. 


Figure 3.5: Complex conjugate numbers 


Conjugates are useful since zZ = z? + y? is real. Moreover, addition and 
subtraction yield z +Z = 2x and z — Z = 2iy, so that we can express the real 
part and the imaginary part of z by the important formulas 


If z is real, z = x and z = z and conversely. 
Working with conjugates is easy, since we have 


(a +2) = 2+2, (a= 22) =A—-%RB 


ea zı Zy 
(2122) = 21 29, = 
22 22 


Example 3.3. Let z = 4 + 3i and z = 2 + 5i. Then 


; ; 3t + 32 

Imz = a (4 + 3) — (4 — 3i)| = P 3 
(iz) = (4+ 3i)(2 + 5i) = —7 + 201 = —7 — 26i 
zZ Z = (4 — 3i)(2 — 5i) = -7 — 26i 
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Example 3.4. (Powers of the imaginary unit) 

i? = —1 since i? = (0,1)(0,1) = (—1,0) = —1 by the definition of complex 
multiplication. Therefore, 

i = i, f = 1, Ë =i, ... 


From these it follows that 


Example 3.5. Let z, = 4 — 5i and z = 2 + 3i. Find in the form z + iy: 


a) (z1 + 22)? = (6 — 2i)? = 32 — 24i 


b) 2 +39 (445i) _ T2, 
a @-5) (445) 41° 41 


Example 3.6. Find 


1 i Ñi 1-i ie i ae 
rag = Re | ay Sy] = Rett = Re (5 - 5!) = 5 
bon ai SPR A ae W ic) _ 3l 

Ta Tai T 5050) 50 


ce) Im 2 =Im= ge ee 
z 


a) Re 


1 z? — y? +2ryi 2x£Y 
xr—iy r+iy = x? +y? gt ty? 


d) (1 + i)? = (1 + i)?t = (2%)4 = 16 
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Example 3.7. If the product of two complex numbers is zero, show that at 
least one factor must be zero. 


Proof. From the definition of complex multiplication we have, 


L1X2— Y1Y2 = 0 
L1Y2 + Loy, = 0 


Assume (22, y2) 4 0. Then (z1,y1) = (0,0) because: 
Solving for x, 


_ Y1Y¥2 — a t2Y1 
Ly SS vy = $ 
T2 Y2 


Substituting the first in the second equation, 


Y1y2 
—— y2 = -T241 
T2 

Yy? = —yx3 


Now if y, 4 0 then —2x} = y2 which is impossible. 


e UNES 0. 


Since yı = 0, 


Tiza — Yiy2 = 0 => 2172 =0 


and 


T1Y2 + T2Y1 = 0 = T1Y2 = 0 


But we assumed (2, y2) 4 0, 


kt = 0. 
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3.4 Polar Form of Complex Numbers. 


We can substantially increase the usefulness of the complex plane and 
gain further insight into the nature of complex numbers if in addition to 
the xy-coordinates we also employ the usual polar coordinates r,@ defined 
by 


xz = r cos, y =rsinð. 


Then z = z + iy takes the so-called polar form 


z = r(cos0 + isin 0) = re”. 


r is called the absolute value or modulus of z and is denoted by |z|. Hence 


jz) =r = yr? +y? = VZ. 


Geometrically, |z| is the distance of the point z from the origin. Similarly, 
|Z, — z2| is the distance between z, and 22. 


Imaginary 
axis 


| 
| 
| 
| l Real 
x axis 
Figure 3.7: Distance be- 
tween two points in the 
complex plane 


Figure 3.6: Complex 
plane, polar form of a 
complex number 


0 is called the argument of z and is denoted arg z. Thus, 


0 = arg z = arctan 2 (z #0). 
£ 


Geometrically, 0 is the directed angle from the positive x-axis to OP. Here, 
as in calculus, all angles are measured in radians and positive in the 
counterclockwise sense. 
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For z = 0 this angle 0 is undefined. For given z 4 0 it is determined only up 
to integer multiples of 27. The value of 0 that lies in the interval —r < 0 < m 
is called the principal value of the argument of z (4 0) and is denoted Arg z. 
Thus 0 = Arg z satisfies by definition 


=T < Årgz <T. 


Yet in many instances, it is proper to define a multivalued argument 
function, 


arg z = Arg z + 2mn = 0 + 2rn, a end e 


The multivalued argument function will become especially useful when we 
study the properties of the complex logarithm and complex power func- 
tions. 


It is useful to have a straightforward, explicit formula for Arg z in terms of 
arg z; this is not simply a matter of subtracting 27n from each side however, 
for that would make Arg z multivalued. First we introduce some notation: 
|x| designates the greatest integer function applied to x; it means the 
largest integer less than or equal to the real number z. That is to say, |x| 
is the unique integer that satisfies the inequality 


x —1< |x] <a for real x and integer |z]. 


For example, |1.7| = 1 and |—0.4] = —1. With this notation, one can write 
Arg z in terms of arg z as 


1 = 


A = H2 
rg z= argz r|; On 


It is straightforward to check that Arg z does indeed fall within the principal 
interval —7 < 0 < 7. 


One should not plainly identify Arg z with arctan(y/x). The real function 
arctan x is multivalued for real values of x. It is conventional to introduce 
a single-valued real arctangent function, called the principal value of the 
arctangent, which is denoted by Arctanz and satisfies —% < Arctanz < 


5. Since -r < Argz < 7 it follows that Argz cannot be identified with 


Arctan(y/z) in all regions of the complex plane. The correct relation between 
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these quantities is ascertained by considering the four quadrants of the 
complex plane. The quadrants of the complex plane are called regions I 
(upper right), II (upper left), IN (lower left), and IV (lower right). 


Table 3.1: Formulae for the argument of a 
complex number z = x + iy. 


Quadrant | Sign of x and y Arg z 
I z>0,y>0 Arctan(y/z) 
II x<0,y>0 T + Arctan(y/x) 
II z<0,y<0 —r + Arctan(y/z) 
IV oS Oy <0 Arctan(y/z) 


The principal value of the argument of z = x + iy in terms of its real part x 
and imaginary part y is given in the above table, assuming z lies within one 
of the four quadrants of the complex plane. Note that Arg z = Arctan(y/z) 
is valid only in quadrants I and IV. If z lies within quadrants II or III, one 
must add or subtract 7 to ensure that 5 < Argz < 7 or —7 < Argz < -5 
respectively. 


Table 3.2: Formulae for the argument of z = x + iy when z is 
real or pure imaginary. 


Quadrant Border type of z x and y conditions Arg z 
IV/I positive real z>0,y=0 (0) 
I/II pure imaginary, Imz > 0 xr=0,y>0 7 
H/M negative real x<0,y=0 T 
IM/IV pure imaginary, Imz < 0 g= hyi ao 
origin Zero r=y=0 undefined 


For finite nonzero values of y/x, the principal values of the arctangent 
function lie inside the interval 0 < Arctan(y/x) < 7/2 if y/x > 0 and inside 
the interval —r/2 < Arctan(y/x) < 0 if y/x < 0. 
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Note that 
0, if y = 0 and x 40, 
Arctan(y/z) my if z = 0 and y <0, 


undefined, ifr=y=0. 


We can view a multivalued function f(z) evaluated at z as a set of values, 
each element of the set corresponding to a different choice of some integer 
n. For example, given the multivalued function arg z whose principal value 
is 0 = Arg z, then arg z consists of the set of values: 


argz = {0,0 + 27,0 — 27,0 + 4r, 0 — 47, ...}. 


Given two multivalued functions, for example f(z;0 + 2rn) and g(z;0 + 27n), 
where f(z;0) and g(z;@) are the principal values of f(z) and g(z) respectively, 
then f(z) = g(z) if and only if for each point z, the corresponding set of 
values of f(z) and g(z) precisely coincide: 


{7 (2); f(z; 2r), f(z; —2r), a ai 
One may refer to the equation f(z) = g(z) as a set equality since all the 
elements of the two sets in eqn. 3.5 must coincide (here the ordering of the 


terms is not important and only distinct, rather than duplicate elements 
matter, which may be deleted). 


(3.5) 


To understand how the set equality of two multivalued functions works, 
consider the multivalued function arg z. It is provable that 


arg(z122) = arg 2, + arg 22 (3.6) 
Zi 

arg (2) = arg zı — arg 22 (3.7) 
22 
1 = 

arg G) = arg Z = — arg z (3.8) 


To prove eqn. 3.6, consider z = |z\e’** and z = |z9|e’*"8*. The arguments 
of these two complex numbers are: argz; = Arg 2, + 2mnı and arg 2. = 
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Arg z2 + 27n2, where nı and ns are arbitr: integers. One can also write 
g 
z = |zle’A™ and z = |z since e?" = 1 for any integer n. It then 
follows that 

22 = |2122 |etArs z1+Arg z2) 


where we used |z;||22| = |2122|. So we have established that 


arg 21 + arg z2 = Arg z1 + Arg z2 + 27 (n1 + n2) 


arg(z1Z2) = Arg z1 + Arg z2 + 2712 


where n1, n2 and nız are arbitrary integers. Thus, arg zı + arg z2 and arg(z1 22) 
coincide as sets, and so eqn. 3.6 is confirmed. One can prove eqns. 3.7 
and 3.8 similarly. In particular, if one writes z = |z|e’"®* and employs the 
definition of the complex conjugate (which yields |Z| = |z| and Z = |z/e~’*"8*), 
then it follows that arg (+) = argz = — arg z. As an example, consider the 
last relation in the case of z = —1. It then follows that 


arg(—1) = —arg(—1) 
as a set equality. This is not paradoxical since the sets 
arg(—1) = {+7,+37,...} and — arg(—1) = {+7, +37,...} 


coincide, as they possess the same list elements. 
For clarity, consider the surprising fact 


arg z? A Qarg z. 
To see why this inequality is surprising, consider the following false proof. 
Use eqn. 3.6 with z, = z2 = z to derive: 
arg z? = arg z + arg z 29 arg z (false) 


The false step is the one indicated by the symbol = above. Given z = glee, 
one finds that 2? = |z|?e"(?4t87+2"™) — |z|2eiArsz, and so the possible values of 
arg(z”) are: 


arg(z”) = {2Argz,2Argz+2n,2Argz+4n,...}, 
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whereas the possible values of 2 arg z are: 
2argz = {2 Arg z, 2(Arg z + 27), 2(Arg z + 4z),...} 
= {2 Arg z, 2 Arg z + 4r, 2 Arg z+ 87,...} 


Therefore, 2arg z is a subset of arg(z”), and half the elements of arg(z*) are 
missing from 2 arg z. These are therefore unequal sets. 


Here is one more example of an incorrect proof. Consider eqn. 3.7 with 
z1 = 2 = z. Then one may be tempted to write: 


Z ? 
arg (=) = arg(1) = arg z — arg z = 0. 
z 


This is clearly wrong since arg(1) = 27n, where n is the set of integers. The 
fallacy of the questionable statement is the same as above. When you 
subtract arg z as a set from itself, the element chosen from the first set 
need not be the same as the element chosen from the second set. 


The properties of the principal value Argz are not as simple as those 
given in eqns. 3.6-3.8, since the range of Arg z is restricted to lie within 
the principal range —7 < Argz < m. Instead, the following relations are 
satisfied: 


Arg(z122) = Arg z1 + Arg z2 + 2n N} (3.9) 


Arg(z1/z2) = Arg z1 — Arg z2 + 2r N_ (3.10) 


where the integers N4 are determined as follows: 


—l, if Arg z, + Arg z2 > 7, 


Na =< 0, if — r < Arg z1 + Arg z < T, 
1, if Arg z1 + Arg 22 < —7. 
If we set z = 1 in eqn. 3.10, we find that 
1) _. __ f Argz, if Imz = 0 and z £0, 
Ag C) = Arg? = { —Argz, if Imz #40. 


Note that for z real, both 1/z and Z are also real so that in this case z = Z 
and Arg(1/z) = Argz = Arg z. 
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If n is an integer, then 


arg z” = arg z + arg z + --- +argz Anargz, 


where the final inequality was noted in the case of n = 2 earlier. But the 
corresponding property of Arg z is much simpler: 


Arg(z”) = n Arg z + 2r Nn, 


where the integer N, is given by 


N= lt eas 
2 2r 


where |-| is the greatest integer function introduced earlier. 


Example 3.8. Polar form of complex numbers. Principal value. 


Letz=1+i. 


Then 


z= v2 (cos? +isin7), |z| = v2, 


arg 2 = T + 2m7, E E EEE 


T 
A = — 
rg z 7i 
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Example 3.9. Polar form of complex numbers. Principal value. 


Let z = 3 + 3V3i. 


0 = Arctan 3v3 = Arctan(V3) = Arctan v3/2 =E 
3 1/2 3 
r = 1/3? + (3V3)? = V9 +27 = V36 = 6 
z = e's 
T 
Arez = 
rg z 3 
E 
Example 3.10. Polar form of complex numbers. Principal value. 
Let z = —3 + 3V3i. 
0 = Arctan v3 HT = R 
—3 3 3 
-27 
z = 6e 3 
27 
Are z = — 
rg z 3 
a 


Triangle inequality. For any complex numbers we have the important 
triangle inequality 


lza + 20] < [zal + lz2l (3.11) 


which we will use quite frequently. This inequality follows by noting that 
the three points 0, z, and z1 + 22 are the vertices of a triangle with sides 
|z1|, |z2| and |z, + z2|, and one side cannot exceed the sum of the other two 
sides. 
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Figure 3.8: Triangle inequality 


Proof. Eqn. 3.11 holds when zı + z2 = 0. Let z1 + 240 and c = a + ib = 
21 / (21 + 22). 


Note that | Re z| < |z| and | Im z| < |z| so we have 


la| < |c], |a—1] < |e- 1l. 
Thus |a| + |a — 1| < |e] + ļe — 1|. 


Clearly |a| + |a — 1| > 1. 


Taken together we have the inequality 


zy 
zı + 22 


Z2 


1< |e] +ļe— 1| = — 


Multiply by |z: + z2| to get 


[21 + Z| < [zal + lz2l. 


Example 3.11. If z; = 1 +i and z, = —2 + 3i, then 


lza + 22| = | — 1 + 42] = V17 = 4.123 < V2 + v13 = 5.020. 
E 


Generalized triangle inequality. By induction we obtain from eqn. 3.11 for 
any sum 


[z1 +22 +t + Zal < leal + [zl +: lnl; 
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that is, the absolute value of a sum cannot exceed the sum of the absolute 
values of the terms. 


3.5 Multiplication and Division in Polar Form. Powers 


This will give us a better understanding of multiplication and division. Let 


zı = 11(cos 6, + isin 01) and Z2 = T2(cos J + isin 03). 


By the definition of complex multiplication, the product is at first 


2122 = rır2|(cos 0; cos 6) — sin 9; sin 62) + i(sin 01 cos 62 + cos 6; sin 02)]. 


Application of trigonometric identities yields 


Taking absolute values and arguments on both sides, we thus obtain the 
important rules 
|z122| = |1|]22| 


and 


arg(z122) = arg 2, + arg z2 (up to multiples of 27) 


The quotient z = z/z2 is the number z satisfying zz. = z1. Hence |zz2| = 
|z||zo| = |z1| and arg(zz2) = arg z + arg z2 = arg z1. This yields 


Z1 zl 
ee zy £0 
29 A ( 2 # ) 
and 
arg Ae arg zı — arg z2 (up to multiples of 27). 


22 
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By combining these formulas we have 


[cos(, — 92) + isin(O, — 02)] = e0103) 
12 


Hence 


|z122| = 6V2 = |z1||z2| = 3V8, 
2v3 la 
3 |22 


and for the arguments we obtain Arg z1 = 37/4, Arg z3 = 7/2, 


zy 


22 


3 
Arg z122 = R = Arg zı + Arg z2 — 27, 
Arg Te a Arg zı — Arg 22. 
Z2 4 
Integer powers of z. From the formula for polar multiplication we have 
z2? = r*(cos 26 + isin 20) = re”. 


Inverting this, we have 


z7? = r° [cos(—20) + isin(—20)] = re", 


and more generally, for any integer n, 


n ind 


z” = r” (cosnð + isin nð) = r”e 
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Example 3.13. Formula of De Moivre. For |z| = r = 1, eqn. 3.14 yields the 
so-called formula of De Moivre 


(cos0 + isin 0)” = cosné + isin nð. 
This formula is useful for expressing cos n and sin nf in terms of cos and 
sin. For instance, if n = 2, 
(cos @ + isin 0)? = cos? @ — sin? 0 + 2 cos @ sin 61 
= cos 20 + i sin 20 


We get the familiar formulas by equating the real and imaginary parts of 
both sides: 


cos 20 = cos? 0 — sin? @ = 2 cos? 0 — 1, 
sin 20 = 2 cos ĝ sin 8. 


This shows that complex methods often simplify the derivation of real 
formulas. m 


3.6 Roots 


If z = w” (n = 1,2,...), then to each value of w there corresponds one value 
of z. Conversely, to a given z # 0 there correspond precisely n distinct 
values of w. Each of these values is called an nth root of z, and we write 


w = are 
Hence this symbol is multivalued, namely, n-valued, in contrast to the 


usual conventions made in real calculus. The n values of /z can easily be 
obtained as follows. In terms of polar forms for z and 


w = R(cosġ + isin), 


the equation w” = z becomes 


w” = R” (cosnọ + isinne) = z = r(cosé + isin 0). 


By equating the absolute values on both sides we have 


R” =r, thus R= Vr 
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where the root is real positive and thus uniquely determined. By equating 
the arguments we obtain 


0 2k 
no =0+2kr, thus ¢= Mee 
n n 
where k is an integer. 
For k = 0,1,...,n — 1 we get n distinct values of w. Further integers of k 


would give values already obtained. For instance, k = n gives 2kr/n = 27, 
hence the w corresponding to k = 0, etc. Consequently, 4/z, for z 4 0, has 
the n distinct values 


where k = 0,1,...,n — 1. These n values lie on a circle of radius <r with 
center at the origin and constitute the vertices of a regular polygon of n 
sides. The value of {/z obtained by taking 0 = Arg z and k = 0 in eqn. 3.15 
is called the principal value of w = ⁄z. 


Example 3.14. Square root. 


From eqn. 3.15 it follows that w = \/z has the two values 


w = yr (cos + isin 5) (3.16) 


Wa = vi [eo (i+) +isin ($+) = Sty 


which lie symmetric with respect to the origin. For instance, the square 
root of 4i has the values 


V4i = +2 (cos 7 +7sin =) = +(V2 + i v2). 


and 
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It follows from eqn. 3.16 that if we use the trigonometric identities 


0 1 6 1 
cos 5 = 4/ 5(1 + cos 9), sin 5 = (J 5 (1 — cos8), 


multiply them by yr, 


0 1 0 1 
vr cos 5 = 4/5 (7 +r cos), vrsin 5 = 4 3(" — rcos8), 


use rcos@ = x, and finally choose the sign of Im yz so that 


sign[(Re VZ) (Im y2)] = signy 


such that w = —w,, we obtain the much more practical formula 
1 ; ae 
Vz =+ |4 5(lel +2) + (signy)iy/ 5 (lel -2) 
where sign y = 1 if y > 0, signy = —1 if y < 0, and all square roots of positive 


numbers are taken with the positive sign. 
| 


Example 3.15. Complex quadratic equation. 
Solve 2? — (5+ i)z +8 +i =0. 


Utilizing the quadratic formula and our equation from above, we have 


1 1 i 1 
2=}6+i+ yi6 -8-i= 16+) i+ 10-8 -i 


1 
=5(6+i)+4y6Ṣ4 
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Example 3.16. nth root of unity. Unit circle. 
Solve the equation 2” = 1. 


From the equation for the nth root of a complex number we obtain 


2k 2k 
Views si PHO eet 
n n 


If w denotes the value corresponding to k = 1, then the n values of 7/1 can 
be written as 1,w,w?,...,w”'. These values are the vertices of a regular 
polygon of n sides inscribed in the unit circle (the circle of radius 1 with 
center 0), with one vertex at the point 1. Each of these n values is called 
an nth root of unity. For instance, 1 = 1,-1+ i and Y1=1,i,-1,-i. 


Figure 3.9: V1 Figure 3.10: V1 


If w is any nth root of an arbitrary complex number z, then the n values of 


</z are 


2 n—1 
W1, WIW, WW... , WIW 


since multiplying w, by w* corresponds to increasing the argument of w, 
by 2kr/n. E 
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Example 3.17. Find the five exact algebraic expressions for the 5th roots 
of unity V/1. 


solution: 
First off we know w’ = 1. 
Next, we know w! = cos + isin #, but this is not an algebraic expression. 


Exact values of cos £ and sin # are not commonly known, so we must find 
them. 


We'll start by making the observation 


So if 6 = a we have cos 26 = cos 30. Next derive identities for cos 20 and cos 36 
in terms of cos@. Use De Moivre’s formula and the fact sin? 6 + cos? 0 = 1. 


(cos + isin 0)? = cos? 0 — sin? 0 + (2 cos Osin 6)i 
= cos 20 + i sin 20 
cos 20 = cos? 0 — sin? 0 = 2 cos? 0 — 1 


sin 20 = 2 cos 0 sin 0 


(cos 0 + isin 0)? = (cos? 0 — sin? 6 + (2 cos 6 sin 0)i) (cos 6 + isin 0) 
= cos? 0 — sin? 0 cos 0 — 2 cos 0 sin? 0 
+ (cos? 6 sin 0 — sin? 0 + 2 cos? 6 sin @)i 
= cos 30 + isin 30 


cos 30 = cos? 0 — (1 — cos? 0) cos 0 — 2 cos 0(1 — cos? 0) 
= cos” 0 + cos? 0 — cos 0 — 2 cos 0 + 2 cos? 0 


= 4 cos? 6 — 3 cos 0 


sin 30 = 3 cos? ô sin 8 — sin? 0 
= 3(1 — sin? 0) sin 0 — sin? 0 
= 3 sin 0 — 4 sin? 8 
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So we have cos 20 = 2cos? 0 — 1 and cos 36 = 4cos® 6 — 3cos@ and the condition 
cos 20 = cos 30. 
Replacing cos 4 by z: 
Ag? — 227 — 32 +1 = 0 
(x —1)(4z? + 22 — 1) =0 


We know that x # 1, so we are left to solve the quadratic part: 


—2 + \/2?-4-4.-(-1) 
2.4 
—2 + /20 
L = —— 
8 


H n 


Since z > 0, 


2T —1 + v5 
£ = Cos = : 
5 4 


Now to solve for sin (27). 


We already know 


—1 
sin 20 = 2 sin ĝ cos 0 = 2 sin 8 (=£) 
and 
sin 30 = 3 sin 8 — 4 sin? 0. 
Note that 
4 i 67 
sin | — ] = —sin | — 
5 ? 
so sin 20 = — sin 30 if 0 = a, 


So we have 


(25 


Replacing sin 6 by y: 


4 
5+V75 
4y’ — ge 
jp 
2 
_ J5+v5 
7 8 


So we have 


Figure 3.11: V1 


3.6 Roots 
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Next we calculate w? = w'w!, w3 =w?w!, and wt = w3w!. The results are: 


al 

TR. 5+ V5 
w =a 1475) +i 5 
ol 5- v5 
on ne V5) +4 5 
ns, Neo! 5- V5 
w? = q (it v5) i 5 


3.7 Review Examples 


Example 3.18. (Multiplication by 7) Show that multiplication of a complex 
number by i corresponds to a counterclockwise rotation of the correspond- 
ing vector through the angle 7/2. 


arg (2122) = arg z1 + arg 22 


T 
arg(z1i) = arg z1 + argi = arg zı + 5 


Example 3.19. Find 


a) [1.5 + 2i| = V1.5? + 2? = v6.25 = 2.5 
b) |cos + isin 6| = v cos? 0 + sin? 0 = V1=1 


5 
-7 E Jee 


on 


a 


_|B+% 7+5i 
7—5i 7+5i 


3.7 Review Examples 


G+ | | -8 | | -8i 8-15: 
(1 + 4i)? = a - Sas os 
_ |120-64i] |120 — 64i 
= | Seas =| 289 


N 2 aN oao 
= 989) ` (289) ~~ V 2892 ` 2892 
_ [18496 136 1/17 8 

289? 289 1/17 17 


Example 3.20. Represent in polar form: 


a) 1 +i = V2(cos~ + isin ~) 


4 4 

b) 6 + 8i 
r = /36 + 64 = V/100 = 10 
8 

0 = Arg z = Arctan o 0.927 

6 + 8i = 10(cos 0.927 + i sin 0.927) 
iV2 
c) : 
4+ 4i 


iy2 4-46 4/24+i4/2 4+4i 
4+4 4-4 16416 32/2 
o 4+4 _ 1ti 
= 16⁄2 4v2 


81 


82 Chapter 3 The Complex Field 


i a 
r= z5 + gy = VI/16 = 1/4 


32 
G=7/4 
iv? a T G mi n 
= Cos H2 sın 
4+4i 4 4 4 
2+3 
d 
ai 


2+3 2+32 5—4 22+ 


544 544 5-4 4l 


24+3i E 
- = 0.563(cos 0.308 + i sin 0.308) 


5+ 4i 


Example 3.21. Determine the principal value of the arguments of 


a) -10 — i 


—1 1 
0 = Arctan =a Tt = Arctan T Tt = —3.042 


b) 2+ 27 


T 
0 = Arctan 1 = — 
rctan A 


3.7 Review Examples 


Example 3.22. Represent in the form z + iy: 


3 3 
a) 2v2 cos + isin 
4 4 
2V2 = V8 = V4 + 4 = V 224 22 
|x| = |y| = 2 
3T 
= 4 


xz +iy=—2+2i 
b) cos(—1.8) + isin(—1.8) = —0.227 — 0.974i numerically. 


Example 3.23. Find all values of the following roots. 


a) /—-81 


9+ 2k 6 + 2k 
ye = Vi (cos L T + isin ai z) 


w = yz has two values k = 0,1. 


0 ie 
w= vi (cos 5+ isin 5) 


V—8i = +(2 — 2i) 
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b) V1 
w? = 1 
1 RER 27 ee es 1 1 
w = cos — + isin — = cos — + isin — = | 
8 8 4 4 AY A2 
www! (ti) (ata) 
V2 V2 AVZ v2 
w? = ww = (z+ 1 )=-3+3 
Va V2 V2 V2 
1 1 1 1 
ee wat = ( i) ( i)=-1 
V2 MDP NNI y2 
w3 = ww! = (G i zi) = : a 
V2 v2 v2 V2 
TE E te ae, ale ; 
W =W W = a 2) =-2 
V2 WI NW y2 
V2 y2 V2 y2 
In summary, V1 = +1, +i, +(1 + i)/ v2. 
c) %—1 
\2 . 4 > i>i 
(1 +i) = 2i = (1+i)*= —4 so a fourth root of —1 is o 
1+i 
wi = — 
a9 
The fourth roots of unity are 1,w =i,w? = —1,w? = —i. 
La Boy, es 
We = Ww = i= | i 
=e V2 v2 V2 
1+i1 1 1 
2 x 
w3 = ww’ = =I] s == 
A z” X 
w S a Be i)= : 2 
cae V2 v2 v2 


In summary, V-I = +(1 + i)/ v2. 
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d) V1+7 
1+i: r= v2, 0=7/4 
; 4+2 4+2k 
JTF = oF (cos L i Lg eel - r), k = 0,1,2 
5 E krn .. kr 
YI +i = V2 | cos — +isin — i k = 1,9,17 
12 12 
E 
Example 3.24. Solve the equation z? — (5 + i)z +8 +i = 0. 
Utilizing the quadratic formula and 
1 f al 
Vz =+ z(l2l +x) + (sign y)i 5 (lel -x)| we have 
1 1 1 3 
z= Letia yi6+i}-8-i= OEO 
1 1/5 1/5 
= -(5 +i) -(-2)] +i xea 
56 +8 i (5 +(-2) iy} (5-1-2) 
il ie ee eo 
=e [a+ ={ 2 =i 
a 
Example 3.25. (Parallelogram equality) 
Show that |z1 + 22|? + [z1 — 29|? = 2(J21|? + [z2|?). 
Proof. 
jal? =2t+y7 
|z2|? = 23 + y2 
[z1 + 22]? = (£1 + £2)” + (yı + y2)” = x1 + £3 + 2z122 + yt + Yz + 2yry2 
|z1 — 22 i (zı = T2)” + (i= Yo)? = ei 2g 5 — 2222 yi y2 — 2y1y2 
|a + zl? + |21 za? = 221 + 2r3 + Qy? + 2y5 


(lea)? + |221") 
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3.8 Topological Aspects of the Complex Plane 


Sequences and Series. The concept of absolute value can be used to define 
the notion of a limit of a sequence of complex numbers. 


Definition 3.1. The sequence 2, z2, z3, ... converges to z if the sequence of 
real numbers |z„ — z| converges to 0. That is, zn > z if |z, — z| > 0. 


Geometrically, z, > z if each disc about z contains all but finitely many of 
the members of the sequence {z,,}. 


Since 

| Re z|,|Imz] < |z| <|Rez|+|Imz| 
Zn > z if and only if Rez, > Rez and Im zn > Imz. 
Example 3.26. Limits of sequences. 


a) 2” > Oif |z| < 1 since |z” — 0| = |z"| > 0. 
b) — > 1 since 
n+? 


—i n—i 


Definition 3.2. {z,,} is called a Cauchy sequence if for each « > 0 there 
exists an integer N such that n,m > N implies |z, — zm| < €. 


Proposition 3.1. {z,,} converges if and only if {z,} is a Cauchy sequence. 


Proof. If zn > z, then Rez, — Rez, Imz, — Imz and hence {Rez,,} and 
{Im zn} are Cauchy sequences. But since 
[Zn — m| < | Re(zn — 2m)| + | Im(én — 2m)| 
= | Rez, — Re 2m| + | Im zn — Im zp, 


{zn} is also a Cauchy sequence. 
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Conversely, if {z,,} is a Cauchy sequence so are the real sequences {Re z,,} 
and {Im zn}. Hence both {Re z,} and {Im z,,} converge, and thus {z,,} con- 
verges. 


oO 


An infinite series D z, is said to converge if the sequence {s,,} of partial 


k=1 
sums, defined by s,, = 21 + 22 +-+- + Zn, converges. If so, the limit of the 
sequence is called the sum of the series. The basic properties of infinite 
series listed below will be familiar from the theory of real series. 


i. The sum and the difference of two convergent series are convergent. 


oO 
ii. A necessary condition for D zę to converge is that z, > 0 as n > o. 
k=1 
oO OO 
iii. A sufficient condition for D z, to converge is that ` |z| converges. 


k=l k=1 
[0,6] OO 


When `> |z,| converges, we will say > zę is absolutely convergent. 
k=1 k=1 


Property (iii), which will be important in later chapters, follows from Propo- 


sition 3.1. For if X [zl converges and t, = |z1| + |z2| + --- + |z,| then 
k=1 
{tn} is a Cauchy sequence. But then so is the sequence {s,,} given by 


Sn = 21 +22+---4+ Zn, Since 


[Sm = Sn | = |Zn+1 + Zn+2 p.e + 2ml 
< Jenal + [znl +- [eml = litm — tnl 


by the triangle inequality. Hence ` Z, converges. 
k=1 
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Example 3.27. Convergence and divergence. 


i ; 
a) 2 ga z; Converges since 


jE qk k2 t kzi" =, gktl 
kotil |k?+a kei] | kA4] 
k2 \? ‘ee 1 
~ (aa) (a) Vki] 
and since 3 l converges 
— Sk +1 cad 
ae. i 
b) >, ea diverges, since 
E Sai which implies that SR : diverges |_| 
k+i k2 +17 p E AET Be 


Classification of Sets in the Complex Plane. We give some definitions relat- 
ing to planar sets. D(zo;r) denotes the open disc of radius r > 0 centered 
at zo; i.e., D(zo;r) = {z: |z — z0| <r}. 

D(z;r) is also called a neighborhood (or r-neighborhood) of zo. 


C(z;r) is the circle of radius r > 0 centered at zo. 


A set S is said to be open if for any z € S, there exists ô > 0 such that 
D(z;6) CS. 


S =C\S denotes the complement of the set S; i.e., S = {z €C:2¢S}. 


A set is a closed set if its complement is open. Equivalently, S is closed if 
{zn} C S and z, > z imply z € S. 


OS, the boundary of S, is defined as the set of points whose 6-neighborhoods 
have a nonempty intersection with both S and S, for every ô > 0. 
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S, the closure of S, is given by S$ = S U ôS. 
S is bounded if it is contained in D(0; M) for some M > 0. 
Sets that are closed and bounded are called compact. 


S is said to be disconnected if there exist two disjoint open sets A and B 
whose union contains S while neither A nor B alone contains S. If S is not 
disconnected, it is called connected. 


[z1, 22] denotes the line segment with endpoints z, and 22. 
A polygonal line is a finite union of line segments of the form 


[Zo, zı] U [z1, zə] U---U Ease Zn). 


If any two points of S can be connected by a polygonal line contained in S, 
S is said to be polygonally connected. 


Figure 3.12: A polygonally connected set. 


It can be shown that a polygonally connected set is connected. The con- 
verse, however, is false. For example, the set of points z = x + iy with y = x? 
is clearly connected but is not polygonally connected since the set contains 
no straight line segments. In fact there are even connected sets whose 
points cannot be connected to one another by any curve in the set. On the 
other hand, for open sets, connectedness and polygonal connectedness are 
equivalent. 


Definition 3.3. An open connected set will be called a region. 
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Proposition 3.2. A region S is polygonally connected. 


Proof. Suppose S is an open connected set and z € S. Let A be an open 
subset of points of S which can be polygonally connected to z in S. Since 
A is open any point z in A can be polygonally connected to any other 
point in D(z; ĉo) for some ôo > 0. Now consider another point z, € S such 
that D(z; ôo) O D(z1;61) is nonempty for some 69,6, > 0. Clearly z, can 
be polygonally connected to z) and, in fact, any point in D(z,;5,) can be 
polygonally connected to any point in D(zọ;ôo). Continuing this chain 
of intersections to include all points of S, we conclude A = S and S is 
polygonally connected. Since every point in S can be polygonally connected 
to an arbitrary zo, every pair of points can be polygonally connected to each 
other in S. 


Continuous Functions. 


Definition 3.4. A complex valued function f(z) defined in a neighborhood 
of zo is continuous at zo if zn > zo implies that f(z,) > f(z). Alternatively, f 
is continuous at 2z if for each e > 0 there is some 6 > 0 such that |z — zo| < 6 
implies that | f(z) — f(zo)| < €. f is continuous in a domain D if for each 
sequence {z} C D and z € D such that z, > z, we have f(z,) > f(z). 


If we split f into its real and imaginary parts 


fle) = f(x,y) = ula, y) + iv(z,y), 


where u and v are real-valued, it is clear that f is continuous if and only if u 
and v are continuous functions of (x, y). Thus, for example, any polynomial 


P(z,y) = p3 `> akt y! 


j=1 k=1 


is continuous in the whole plane. Similarly 


1 £ Y 
= a 
z HY ey 
is continuous in the "punctured plane" {z : z # 0}. It follows also that 


the sum, product and quotient (with nonzero denominator) of continuous 
functions are continuous. 
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We say f € C” if the real and imaginary parts of f both have continuous 
partial derivatives of the n-th order. 


A sequence of functions { f„} converges to f uniformly in D if for each e > 0, 
there is an N > 0 such that n > N implies |f,,(z) — f(<)| < e for all z € D. 
Again, by referring to the real and imaginary parts of {fn}, it is clear that 
the uniform limit of continuous functions is continuous. 


Theorem 3.1. M-Test 


Suppose fk is continuous in D, k =1,2,.... If |f,(z)| < M; throughout D 
and if 3 M, converges, then S f,(z) converges to a function f which is 


o k=l k=1 
continuous in D. 


Proof. The convergence of >. fx(z) is immediate. Moreover, for each e > 0, 


k=1 
we can choose N so that 


to ~ > falz)| = 


for n > N. Hence the convergence is uniform and f is continuous. 


ye) 


k=n+1 


< Maa Mago se <E 


oO 


Example 3.28. f(z DA kz" is continuous in D : |z| < 4 5 Since 
k=1 
kijat d w 
eee |) < zk in Dan ` T converges. E 
k=1 


A continuous function maps compact/connected sets into compact/con- 
nected sets. None of the other properties listed above, though, are pre- 
served under continuous mappings. For example, f(z) = Rez maps the 
open set C into the real line which is not open. The function g(z) = 1/z 
maps the bounded set: 0 < |z| < 1 onto the unbounded set: |z| > 1. 


Most of the key results in subsequent chapters will concern properties of 
a certain class of functions defined on a region. We note that we could 
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show that any two points in a region can be connected by a polygonal line 
containing only horizontal and vertical line segments. For future reference 
we will introduce the term "polygonal path" to denote such a polygonal 
line. 


Theorem 3.2. Suppose u(x, y) has partial derivatives u, and u, that vanish 
at every point of a region D. Then u is constant in D. 


Proof. Let (1, yı) and (x2, y2) be two points of D. Then they can be connected 
by a polygonal path that is contained in D. Any two successive vertices 
of the path represent the end-points of a horizontal or vertical segment. 
Hence, by the Mean-Value Theorem, the change in u between these vertices 
is given by the value of a partial derivative of u somewhere between the 
end-points times the difference in the non-identical coordinates of the 
end-points. Since, however, u, and u, both vanish in D, the change in u is 
0 between each pair of successive vertices; hence u(x1, y1) = u(x2, y2). 


3.9 Stereographic Projection; The Point at Infinity 


The complex numbers can also be represented by the points on the surface 
of a punctured sphere. Let 


1 1 
p= {eng ieanr ees ahh (3.17) 


that is, let © be the sphere in Euclidean (€,7,¢) space with distance + from 
(0, 0, +). Suppose, moreover, that the plane ¢ = 0 coincides with the complex 
place C, and that the € and ņ axes are the x and y axes, respectively. To 
each (€,7,¢) € E we associate the complex number z where the ray from 
(0,0, 1) through (£, n, C) intersects C. This establishes a 1-1 correspondence, 
known as a setereographic projection, between C and the points of © other 
than (0,0, 1). 


Formulas governing this correspondence can be derived as follows. Since 
(0,0,1), (€,7,¢) and (2, y,0) are collinear, 
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(x, y, 0) 


Figure 3.13: The Riemann sphere 
and stereographic projection. 


so that 
Š 2 (3.18) 


Y= ae 


mE 1=¢ 
The equations 3.17 and 3.18 can be solved for €,7,¢ in terms of x,y as 
follows. By substitution we solve for ¢, 


2 
elt) ela (ye (¢-5) = 
al (y= CP + (e-¢+3) = 

4) 4 


et)... 
<7 


z?’ +y’ + 


E ee (3.19) 
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Now it is a fact that 


x+y? 
w2+y24+1 
oe = r? i y? 
1— xr?+y2+1 
So therefore, 
be z? + y? 
a z2 + y? + 1 
x 
= 7(1 -= — > 3.20 
yY 
pay) Ppt 


Now suppose that {o;,} = {(&, Nk, ¢,)} is a sequence of points of © which 
converges to (0,0,1) and let {z,} be the corresponding sequence in C. By 
3.18 and 3.19, 


Bais ga 
faa = Ç 


T= Ter 


Z| > œ. Conversely, it follows from 3.20 that if 


so that as o; — (0,0,1), 
|z,| = œ, o, — (0,0,1). 


Loosely speaking, this suggests that the point (0,0,1) on © corresponds 
to co in the complex plane. We can make this more precise by formally 
adjoining to C a "point at infinity" and defining its neighborhoods as the 
sets in C corresponding to the spherical neighborhoods of (0,0, 1). While 
we will not examine the resulting "extended plane" in greater detail, we will 
adopt the following convention. 


Definition 3.5. We say {z} > © if |z| > oo, i.e., |z| > oo if for any M > 0, 
there exists an integer N such that k > N implies |z;,| > M. Similarly, we 
say f(z) > œ if |f(z)| > oo. 


For future reference, we note the connection between circles on © and 
circles in C. By a circle on £, we mean the intersection of © with a plane of 
the form A£ + Bn + CC = D. According to 3.20, if S is such a circle and T is 
the corresponding set in C, 


Ax + By + C(2? +y?) = D(z? +4? +1) 
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or 
(C — D)(x? + y*) + Ax + By = D (3.21) 


for (x,y) € T. Note that if C 4 D, 3.21 is the equation of a circle. If C = D, 
3.21 represents a line. Since C = D if and only if S intersects (0,0, 1), we 
have the following proposition. 


Proposition 3.3. Let S be a circle on © and let T be its projection on C. 
Then 


a. If S contains (0,0,1), T is a line; 
b. If S does not contain (0,0,1), T is a circle. 


Proof. Plugging (0,0,1) into A£ + Bn + CÅ = D yields C = D so eqn. 3.21 
becomes the equation of a line which is equivalent to a circle of infinite 
radius. If S does not contain (0,0,1), C # D and eqn. 3.21 becomes the 
equation of a circle. 


3.10 Curves and Regions in the Complex Plane 


Circles and disks. The distance between two points z and a is |z — a|. Hence 
a circle C of radius p and center at a can be given by 
|z -—al =p (3.22) 
In particular, the unit circle, that is, the circle of radius 1 and center at the 
origin a = 0, is 
kz|=1. 


Furthermore, the inequality 


|z-—al<p (3.23) 


holds for every point z inside C; that is 3.23 represents the interior of C. 
Such a region is called a circular disk, or, more precisely, an open circular 
disk, in contrast to the closed circular disk 


P= aS p, 


which consists of the interior of C and C itself. The open circular disk 3.23 
is also called a neighborhood of the point a. 
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Obviously, a has infinitely many such neighborhoods, each corresponding 
to a certain value of p > 0, and a is a point of each such neighborhood. 


Similarly, the inequality 


|z—al > p 
represents the exterior of the circle C. Furthermore, the region between 
two concentric circles of radii pı and p2(> pı) can be given by 
pi < |z—al < pa, 


where a is the center of the circles. Such a region is called an open circular 
ring or open annulus. 


Example 3.29. |z — 3+ i| < 4 is valid for all z whose distance from a = 3 —i 
does not exceed 4. Hence this is a closed circular disk of radius 4 with 
center ar 3 — i. ia 


Half planes. By the (open) upper half-plane we mean the set of all points 
z = x + iy such that y > 0. Similarly, the condition y < 0 defines the lower 
half-plane, x > 0 the right half-plane, and z < 0 the left half-plane. 


Example 3.30. Determine the sets represented. 


a) |z —4i|=4 
Circle, radius 4, center 4i. 


1 
Annulus with center a. 


c) 0 < Rez < 7/2 
Vertical infinite strip. 


d) |z — 1| < |z+1]| 
Right half-plane. 
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3.11 Exercises 


1. Express in the form a + bi. 


1 
a. 
6 + 22 
b (2 +7)(3 + 27) 
` 1-i 
4 
c EM 
2 2 


d. 2, 13,14, i 
2. Find (in rectangular form) the two values of //—8 + 6i. 
3. Solve the equation 2? + /32iz — 6i = 0. 
4. Prove the following identities: 


zı + 22 = 21 + 22. 


a. 
b. 2122 = 21° 29. 


c. P(z) = P(zZ), for any polynomial P with real coefficients. 
ds 7=2) 
5. Suppose P is a polynomial with real coefficients. Show that P(z) = 0 if 


and only if P(z) = 0 (i.e., zeros of real polynomials come in conjugate 
pairs). 


6. Verify that |z?| = |z|? using rectangular coordinates and then using 
polar coordinates. 


7. Show 
a |e" |= |e)". 
be |z = zz: 


c. |Rez|,|Imz| < |z| < |Rez|+|Imz| 
(when is equality possible?) 
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8. 


10. 


ll. 
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a. Fill in the details of the following proof of the triangle inequality: 


[z1 + zal? = (21 + 22)(% + 2) 
= jal? + lal? + 212 + Az 
= |z|? + |z2|? + 2 Re(z122) 


< [a]? + [z217 + 2]z1||ze| 


b. When can equality occur? 


c. Show: |z = zal > [zıl = | zo|. 


. It is an interesting fact that a product of two sums of squares is itself 


a sum of squares. For example, 
(1? + 27)(3? + 47) = 125 = 5° + 10? = 2? + 11°. 


a. Prove the result using complex algebra. That is, show that for any 
two pairs of integers, {a,b} and {c,d}, we can find integers u,v with 


(a? +PP +d’) =w+v" 


b. Show that, if a,b,c,d are all nonzero and at least one of the sets 
{a?,b*} and {c’, d*} consist of distinct positive integers, then we can 
find u?,v? as above with u? and v? both positive. 


c. Show that, if a,b,c,d are all nonzero and both of the sets {a”,b?} 
and {c’,d?} consist of distinct positive integers, then there are two 
different sets {u?,v?} and {s*,¢?} with 


(a? + be + d’) =u a =s He. 


d. Give a geometric interpretation and proof of the results in b) and c) 
above. 


Prove: |a + z|? + |z1 — 22|? = 2(\z1/? + |z2|”?) and interpret the result 
geometrically. 


Let z = x + iy. Explain the connection between Arg z and tan~!(y/z). 
(Warning: they are not identical.) 


12. 


13. 


14. 


15. 
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Solve the following equations in polar form and locate the roots in the 
complex plane: 


c. 24 = —1 + vV3i. 


Show that the n-th roots of 1 (aside from 1) satisfy the "cyclotomic" 
equation 2"~! +2”? +---+z+1=0. [Hint: Use the identity z” — 1 = 
(z = 1) (20t + 27? +- 1).] 


Suppose we consider the n — 1 diagonals of a regular n-gon inscribed 
in a unit circle obtained by connecting one vertex with all the others. 
Show that the product of their lengths is n. [Hint: Let the vertices all 
be connected to 1 and apply the previous exercise.] 


Describe the sets whose points satisfy the following relations. Which 
of the sets are regions? 


a. |z—i| <1. 
—1 

by Cli; 
z+1 


z—2|> |z- 83l. 


d. |z| < 1 and Imz > 0. 
Pan 

e. = =z. 
z 

f. |z|? = Imz. 


g. |z? — 1| < 1. [Hint: Use polar coordinates.] 


16. Identify the set of points which satisfy 


a. |z| = Rez +1. 


b. |z- 1|+|z+1|= 4. 


c. 2 =Z. 
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17. 


18. 


19. 


20. 


21. 


22. 
23. 


24. 


Chapter 3 The Complex Field 


Let Arg w denote that value of the argument between —r and 7 (inclu- 
sive). Show that 


A z—=1\_ f x7/2 if Imz >0 
ERSEN a —r/2 if Imz<0 


where z is a point on the unit circle |z| = 1. 


Find the three roots of r? — 6x = 4 by finding the three real-valued 
possibilities for ¥/2 + 2i + ¥/2 — 2i. (Note: You can find the three cube 
roots of 2 + 2i, or you can simplify the problem by first applying the 
identity: a + b = (a? + b?) /(a? — ab + b°)). 


Prove that zx? + px = q has three real roots if and only if 4p? < —27q’. 
(Hint: Find the local minimum and local maximum values of x?+-pxr—q.) 


a. Let P(z) = 1+ 2z +32? -+nz"—', By considering (1 — z)P(z), show 
that all the zeros of P(z) are inside the unit disc. 


b. Show that the same conclusion applies to any polynomial of the 
form: ao + aiz + a22? + --- + anz”, with all areal and 0 < ap < a; < 
nai Rehr: 


Show that 
a. f(z) =o, kz" is continuous in |z| < 1. 
b. g(z) = Xz 1/(k? + z) is continuous in the right half-plane Re z > 0. 
Prove that a polygonally connected set is connected. 
Let 
. 1 
S= {etiyse Dore >0.y=sinz}, 
x 


Show that S is connected, even though there are points in S that 
cannot be connected by any curve in S. 


Let S = {(€,7,¢) E€ & : ¢ > Gy} where 0 < ĉo < 1 and let T be the 
corresponding set in C. Show that T is the exterior of a circle centered 
at 0. 


25. 


26. 


27. 


28. 


3.11 Exercises 101 


Suppose T C C. Show that the corresponding set S C X is 


a. acircle if T is a circle. 
b. a circle minus (0,0, 1) if T is a line. 


Let P be a nonconstant polynomial in z. Show that P(z) => œ as 
z> œ. 


Suppose that z is the stereographic projection of (£, n, Ç) and 1/z is the 
projection of (€',77’,¢’). 


a. Show that (E, n’, C) = (E, -—n, 1 — Å). 


b. Show that the function 1/z, z € C, is represented on » by a 180° 
1 1 these I 

rotation about the diameter with endpoints eee 0, 5) and (5) 0, 5): 

Use exercise (27) to show that f(z) = 1/z maps circles and lines in C 


onto other circles and lines. 


